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RNA VIRUS-DERIVED PLANT EXPRESSION SYSTEM 

FIELD OF THE INVENTION 

The present invention relates to the plant, plant parts or plant cell cultures having a 
heterologous DNA encoding an RNA replicon for expressing a sequence of interest. The 
invention also provides a process of expressing a sequence of interest in plants, plant parts 
or plant cell cultures. The process and vectors provide the plant cells with an increased 
frequency of RNA virus-derived RNA replicon formation. Said heterologous DNA or part(s) 
thereof can be stably incorporated into the plant nuclear chromosomal or episomal DNA or 
transiently delivered. The invention also provides processes of Agrobacterium-mediated 
transformation of plants with (RNA) viral vectors or (RNA) viral replicons. 

BACKGROUND OF THE INVENTION 

Among plant transgene expression systems, expression of a transgene under the 
control of a heterologous promoter has been in use for several years. Apart from such 
conventional plant expression systems, virus-based expression systems can be used for 
rapid protein production in plants (for review see: Porta & Lomonossoff, 1996, MoL 
Biotechnol., 5* 209-221; Yusibov et aL, 1999, Curr.Top. Microbiol. Immunol., 240. 81-94) and 
are a powerful tool for functional genomics studies (Dalmay et aL, 2000, Plant Cell, 12, 369- 
379; Ratcliff et aL, 2001, Plant J., 25, 237-245; Escobar et aL, 2003, Plant Cell, 15, 1507- 
1523). Numerous publications and patents in the field describe systems based on DNA and 
RNA viral vectors (Kumagai et aL, 1994, Proc. Natl. Acad. ScL USA, 90, 427-430; Mallory et 
aL, 2002, Nature Biotechnol. 20. 622-625; Mor et aL, 2003, Biotechnol. Bioeng., 81; 430-437; 
US531 6931 ; US5589367; US5866785; US5491 076; US5977438; US5981 236; 
WO02088369; WO02097080; W09854342). The existing viral vector systems are usually 
restricted to a narrow host range in terms of their best performance and even the expression 
level of such vectors in their most favourable host is far below the upper biological limits of 
the system. An important issue of virus-based systems is the method of delivery of the viral 
replicon to a plant cell. The most broadly applied method of delivery for large-scale 
production (simultaneous production in many plants, e.g. in a farm field or a greenhouse) is 
the use of infectious copies of RNA viral vectors (Kumagai et aL, 1995, Proc. Natl. Acad. ScL 
USA, 92, 1679-1683). Because of a relatively high tendency of recombinant viral RNA 
vectors to lose the heterologous inserts during the cycles of their replication, the method 
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requires transcription of DNA templates in vitro, and as a result is inefficient and expensive. 
Another approach to solve the delivery problem could be the presence of a viral RNA 
replicon precursor in each cell of a transgenic plant, such that it can be released upon 
triggering the replication process by complementing a function of the viral vector (e.g. using 
helper virus - US5965794) or using other regulated switch systems (e.g. site-specific 
recombination - US6632980). 

Despite many publications in the field including patented technologies, there are still 
no large scale virus-based production systems that work with sufficient efficiency and yield 
for commercial high-yield production, predominantly due to two main reasons: 
Firstly, transient plant virus-based expression systems are generally restricted to specific 
hosts, which may not be suitable for large scale cultivation due to their susceptibility to 
environmental factors. Moreover, they are generally restricted to certain parts of a plant host, 
thus excluding most of the plant biomass from the production process and as a result 
minimizes the relative yield of the recombinant product per unit of plant biomass down to a 
level comparable to that achievable by a conventional transcription promoter in a transgenic 
plant; 

Secondly, attempts to scale up the virus-based production system by generating transgenic 
plant hosts having the viral replicon precursor stably integrated in each cell have not 
provided a solution either, in particular because of underperformance of said replicons in 
such position, "leakiness" of the gene of interest to be expressed from said replicon and lack 
of an efficient switch system for said vectors. Certain progress was achieved with PVX-based 
vectors by using suppressors of PTGS silencing as trigger of RNA replicon formation 
(Mallory et a/., 2002, Nature Biotechnol., 20, 622-625), but the system is still impractical, as 
there is no solution provided for an efficient control of the switch (PTGS suppressor) 
triggering viral vector replication. However, this system provided for an expression level of 
the GUS gene reaching 3% of total soluble protein (TSP), which is the best known so far for 
this type of system, but still no better than a conventional transgene expression system 
under control of a strong promoter. Another inducible system based on a plant tripartite RNA 
virus (Mori et a/., 2001, Plant J., 27, 79-86), Brome Mosaic Virus (BMV), gave a very low 
yield of the protein of interest (3-4 ug/g fresh weight), which is comparable with the yields 
provided by standard transcriptional promoters. 

The low expression levels achieved so far with plant expression systems are a major 
reason why these systems are hardly competitive with other expression systems like 
bacterial, fungal, or insect cell expression systems. Low expression levels give rise to very 
high downstream costs for protein isolation, and purification in a huge background of plant 



WO 2005/049839 



PCT/EP2004/012743 



3 

material. Therefore, costs for downstream processing quickly decrease, as the yield of the 
protein or product of interest per unit plant biomass increases. 

There is presently no large-scale plant transgene expression system the yield and 
efficiency of which would be sufficiently high to compete on the market with other large-scale 
expression systems like bacterial, fungal, or insect cell expression systems. Such a plant 
expression would have to fulfill the following criteria as good as possible: 

(i) high yield, including expression of the protein of interest in as many plant tissues as 
possible and in as many cells of said tissues; 

(ii) for preventing a deleterious effect of protein expression on plant growth, expression 
of the protein or product of interest should be switchable such that expression can be 
switched on at a desired point in time. 

(iii) the switching should be such that expression can be switched on simultaneously or 
nearly simultaneously in all tissues or cells of a plant and, at the same time, in all 
plants of a selected group of plants, e.g. in all plants of a selected lot of plants. 
Typically, the protein or product of interest accumulates in each cell producing said 
product or protein up to a certain point. During accumulation, however, degradative 
processes frequently set on that tend to reduce the yield or quality of the protein or 
product of interest. Therefore, there is an optimal point in time after switching on 
expression, where the product or protein of interest should be harvested. This optimal 
point in time should be reached in all tissues or cells of a plant and in all plants of a 
selected lot at the same time to make the overall process efficient and profitable. 

Therefore, it is an object of this invention to provide a transgenic plant, plant part, or 
plant cell culture for a high-yield plant expression system. It is another object to provide a 
process of transiently expressing a sequence of interest in a plant, plant part, or plant cell 
culture. It is another object of the invention to provide an efficient process of expressing one 
or more sequences of interest in a plant, plant part of plant cell culture, whereby said process 
can be used efficiently on a large scale. Further, it is an object of the invention to provide a 
method of controlling the expression of nucleic acid sequence(s) of interest in a plant, plant 
part, or plant cell culture, which is of improved ecological and biological safety. 
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GENERAL DESCRIPTION OF THE INVENTION 

The above objects are achieved by a transformed plant, plant part, or plant cell 
culture containing in cell nuclei a heterologous DNA having a sequence encoding an RNA 
replicon, said sequence being operably linked or linkable to a transcription promoter, 
wherein said sequence encoding an RNA replicon contains 

(i) sequences for replicon function of said RNA replicon, said sequences being derived 
from a sequence of a plant RNA virus, and 

(ii) a sequence of interest to be expressed from said RNA replicon, 

whereby said sequences for replicon function correspond to sequences of said plant RNA 
virus and exhibit at selected localities of said sequence of said plant RNA virus function- 
conservative differences from said sequence of said plant RNA virus. Said differences can 
cause an increased frequency of replicon formation compared to an RNA replicon not 
exhibiting said differences. Preferably, said plant, plant part, or cells of said plant cell culture 
are stably transformed with said heterologous DNA. 

Further, the above objects are achieved by a process of expressing a sequence of 
interest in a plant, plant part, or plant cell culture, comprising: 

(a) providing a plant, plant part, or plant cell culture containing in cell nuclei a 
heterologous DNA as defined above and 

(b) causing expression of said sequence of interest. 

Cells of said plant, said plant part, or said plant cell culture may be stably or transiently 
transformed with said heterologous DNA. 

The invention also provides a process of producing a transgenic plant stably 
transformed on a nuclear chromosome with a heterologous DNA as defined above, 
comprising transforming a plant or a plant part with a vector containing said heterologous 
DNA, selecting tissue of said plant containing on a nuclear chromosome said heterologous 
DNA, and regenerating a transgenic plant from said tissue. 

The invention further provides a process of transiently expressing a sequence of interest in a 
plant, plant part, or plant cell culture, comprising: 

transforming a plant, plant part, or plant cell culture with a heterologous DNA having a 
sequence encoding an RNA replicon operably linked or linkable to a transcription promoter, 
wherein said sequence encoding an RNA replicbncoritains 

(i) sequences for replicon function of said RNA replicon, said sequences being derived 
from a sequence of a plant RNA virus, 
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(ii) a sequence of interest, 

whereby said sequences for replicon function exhibit at selected localities of said sequences . 
of said plant RNA virus function-conservative differences from said sequence of said plant 
RNA virus. Said differences can cause an increased frequency of replicon formation 
compared to an RNA replicon not exhibiting said differences. 

The invention further provides a nucleic acid molecule for producing said plant, plant 
part, or plant cell culture of the invention or for carrying out the process of the invention. Said 
nucleic acid molecule is as defined in the claims and as further defined as described below 
with reference to the plant, plant part of plant cell culture and said process of the invention. 

When the inventors of the present invention introduced a heterologous DNA encoding 
RNA viral replicons in nuclear chromosomes of plants or plant parts for expressing a protein 
of interest encoded in said RNA replicon, they found that the frequency with which RNA 
replicons appeared in the cytosol was very low and occurred only in a small fraction of cells 
containing the heterologous DNA. Accordingly, the expression level of the protein of interest 
was also very low. Many possible reasons for this problem were considered including 
positional effects of the chromosome, unsuitable transcription regulatory elements, gene 
silencing, defective transport of the replicon from the nucleus to the cytosol, a deleterious 
effect of the sequence of interest on transcription, RNA processing or replicon function etc. It 
was then surprisingly found that certain A/T(U)-rich sequence portions in the replicon were 
responsible for the low frequency of replicon formation in the. cytosol. When the deleterious 
effect of said A/T(U)-rich sequence portions was suppressed, the frequency of replicon 
formation in the cytosol strongly increased, resulting in a strongly increased yield of the 
protein of interest. 

The efficiency of the present invention is such that a new dimension in plant 
expression systems is attained. The expression levels achievable with the present invention 
are such that expenditures for downstream processing (including separation and purification 
of the protein of interest) are low enough to make the process of the invention competitive 
with , other large-scale expression systems. In prior art expression systems using stably 
transformed plants, the expression level is low even if virus-based vectors are used, since 
replicons are produced in a small fraction of the cells. Replicons that spread in the plant 
cannot remedy this problem, as spreading is slow, notably over long distances. Therefore, 
expression does, not proceed uniformly in the plant and degradation of the protein of interest 
will already take place in some parts of the plant while in others protein expression has not 
even started. The invention allows to trigger expression uniformly throughout the plant. The 
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small fraction of cells that do not produce a replicon can be quickly invaded by replicons from 
neighbouring cells. The invention provides the first high-yield plant expression system that 
can be used on large scale. The invention even allows to produce two or more replicons in 
the same cell, whereby the probability of having both replicons in the same cells is still very 
high. Further, the efficiency of the expression system of the invention is such that the 
otherwise limiting plant specificity of RNA viruses is reduced. 

The improved efficiency as described above can be achieved in combination with 
stable transformation as well as with transient transformation of plants, plant parts, or plant 
cells. 

The (optionally stably) transformed plant, plant part, or plant cell culture and said 
nucleic acid molecule have a heterologous DNA encoding an RNA replicon. Said sequence 
encoding an RNA replicon contains 

(i) sequences for replicon function of said RNA replicon, said sequences being derived 
from a sequence of a plant RNA virus, and 

(ii) a sequence of interest to be expressed from said RNA replicon. 

Said sequence encoding an RNA replicon contains a sequence of interest to be 
expressed from said RNA replicon (ii). Said sequence of interest to be expressed may lead 
to formation of an RNA of interest, like an RNA for RNA interference for suppressing a 
function of said plant. Preferably, however, said sequence of interest codes for a protein of 
interest and contains regulatory sequences for translating said protein of interest e.g. from 
said RNA replicon or from subgenomic RNA of said RNA replicon. The sequence of interest 
may include a sequence coding for a targeting signal for targeting the protein of interest to a 
particular cell compartment or for secreting said sequence of interest. Amino acid sequences 
for separating said protein of interest from a targeting signal may also be encoded. Said 
sequence of interest is a sequence that is heterologous to any sequences of said plant RNA 
virus, i.e. the process of the invention does not comprise a case restricted to transformation 
of a wild-type plant RNA virus into plants or plant leaves. Thus, said protein of interest is not 
a protein encoded by said plant RNA virus from which said sequences for replicon function 
are derived. 

Said sequences for replicon function (i) of said RNA replicon correspond to 
sequences of said plant RNA virus inter alia in that the former may be a DNA copy of the 
latter. Said sequences for replicon function provide the RNA replicon with the function to 
replicate in the cytosol. Said sequences for replicon function typically code for one or more 
proteins involved in replication like an RNA-dependent RNA polymerase (replicase). Said 
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sequences for replicon function may further code for functions of an RNA replicon like one or 
more proteins involved in cell-to-cell or systemic spreading of an RNA virus in a plant like a 
movement protein or a coat protein. Said sequences for replicon function are preferably 
derived from a sequence of a plant RNA virus, since plant RNA viruses are an easily 
accessible source for replicon functions. "Being derived" means that said sequences for 
replicon function are essentially a DNA copy of the corresponding sequences of said RNA 
virus and said DNA copy makes up a part of said heterologous DNA contained or to be 
introduced in cell nuclei. "Being derived" further means that said sequences for replicon 
function are not an exact DNA copy of the corresponding RNA sequence of said RNA virus, 
but exhibit function-conservative differences as described below. Since said differences are 
function-conservative, said sequences for replicon function preferably code for proteins 
capable of carrying out replicon functions similarly as they do for said RNA virus. Such 
function-conservative differences may, however, result in quantitative differences in the 
functionality of the encoded viral proteins compared to a case where such function- 
conservative differences are absent In one embodiment, said heterologous DNA and said 
sequences for replicon function do not code for a protein required for long-distance 
movement like a coat protein (notably a tobamoviral coat protein). In another embodiment, 
said heterologous DNA lacks a movement protein. Thus, said sequences for replicon 
function of said heterologous DNA do not have to code for all functions of the RNA virus from 
which said sequences for replicon function are derived. 

Said sequences for replicon function exhibit at selected localities of said sequence of 
said plant RNA virus function-conservative differences relative to said sequence of said plant 
RNA virus, said differences causing an increased frequency of replicon formation compared 
to an RNA replicon not exhibiting said differences. Said differences are causal for said 
increased frequency of replicon formation in plant cells, once the overall process has been 
switched on (see below). The causal connection between the increased frequency of replicon 
formation and said differences can be tested experimentally by comparing the frequency of 
replicon formation between sequences for replicon function having said differences and 
sequences for replicon function not having said differences. Such an experimental 
comparison can be made e.g. by counting protoplasts expressing said sequence of interest 
as described in the examples. Preferably, a sequence of interest coding for an easily 
detectable reporter protein like green fluorescent protein (GFP) is used for this purpose. As 
further described below, it is also preferred to perform the experimental comparison with 
RNA replicons not capable of cell-to-cell spreading. 

Said function-conservative differences are introduced into said sequences for replicon 
function at selected localities of said sequence of said plant RNA virus. Said selected 
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localities are localities in sequences for replicon function of said plant RNA virus that are 
responsible for a low probability of an RNA replicon transcribed in the nucleus to appear in 
the cytosol as a functional replicon. Preferably, such selected localities have a high A/T(U)- 
content, i.e. a high A-content and/or a high T-content (a high U-content on RNA level), or 
have cryptic splicing sites, i.e. sequence portions that can be recognized by the nuclear 
splicing machinery as splicing sites. Said selected localities may be identified in an RNA 
virus on which an RNA replicon is based by analyzing the RNA profile of the RNA virus as 
exemplified below. Further, selected localities may be identified experimentally by analyzing 
the RNA formed in a plant cell after transformation with a heterologous DNA encoding an 
RNA replicon that does not exhibit said (function-conservative) differences according to the 
invention. This experimental analysis may be done by RT-PCR, preferably together with 
sequencing of the RT-PCR products. In the RT-PCR test, the replicase is preferably 
rendered dysfunctional e.g. by a frame-shift mutation in order to prevent RNA replicons 
reaching the cytoplasm from amplifying; such amplification may lead to contamination of 
RNA transcripts with wild type virus or to an overrepresentation of amplified RNA replicons in 
the cytoplasm. In this way, undesired splicing products that indicate splicing events 
destroying the RNA replicon may be identified. Further, the exact sites of undesired splicing 
may be identified and then remedied by introducing said function-conservative differences at 
said selected localities. 

Thus, the invention also provides a process of expressing a sequence of interest in a 
plant, plant part, or plant cell culture, wherein (A) a plant, plant part, or plant cell culture is 
provided with a heterologous DNA as defined herein but lacking said function-conservative 
differences, (B) testing RNA derived from said heterologous DNA for undesired splicing 
products in said sequences for replicon function (e.g. by RT-PCR), (C) identifying (e.g. in the 
sequence of a product of said RT-PCR), a selected locality as a locality of an undesired 
splicing event, (D) introducing a function-conservative difference (e.g. an intron) according to 
the invention into or near said selected locality identified in step (C) into the heterologous 
DNA of step (A) for producing said heterologous DNA of the invention, and expressing a 
sequence of interest in a plant, plant part, or plant cell culture according to the invention, e.g. 
from a plant stably or transiently transformed with said heterologous DNA of the invention. 

Said function-conservative differences cause an increased frequency of RNA replicon 
formation by suppressing the deleterious effect of said selected localities on said frequency 
of RNA replicon formation. Said function-conservative differences may comprise a reduction 
of a high A/U-content in said RNA replicon by reducing a high ATT content in said sequences 
for replicon function of said sequence encoding said RNA replicon. Said high A/U content 
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may be reduced by at least partial deletion or at least partial replacement by G/C bases (e.g. 
using the degeneracy of the genetic code), provided said differences are function- 
conservative. Further, cryptic splicing sites flanking A/U-rich regions of said sequences 
derived from a plant RNA virus may be removed. Such function-conserved differences may 
be introduced at one or at, preferably, several selected localities. 

Preferred function-conservative differences comprise the insertion of one or more 
introns, most preferably nuclear introns, or one or more sequences capable of forming 
nuclear introns near or within A/U-rich localities of said sequences being derived from 
sequences of said plant RNA virus. It has surprisingly been found that the introduction of 
introns at or near A/U-rich localities results in an increased frequency of RNA replicon 
formation. Several introns may be introduced and examples are given herein for various 
numbers of introduced introns. The effects of more than one intron are cumulative. Further, 
intron insertion may be combined with other function-conservative differences at other 
selected localities. 

Fig. 8 shows an example for the introduction of sequences capable of forming a 
nuclear intron, albeit in the gene of interest to be expressed. In the example of Fig. 8, the 
intron is formed from two intron halfs upon recombinase-catalyzed flipping of a part of said 
heterologous DNA. This principle may also be applied to sequences for replicon function of 
said RNA replicon. In an embodiment wherein two different RNA replicons are formed in the 
same cell, recombination between said two different replicons may result in the formation of 
an intron from two intron halfs present on different replicons. Further, an RNA replicon may 
be formed by recombination between two precursors, neither of which is a replicon. Also in 
this case, an intron may be assembled from two intron halfs derived from different precursor 
molecules. 

Said plant, said plant part, or said plant cell culture of the invention may be stably 
transformed with said heterologous DNA. Stably transformed means that they contain said 
heterologous DNA in cell nuclei such that said heterologous DNA is maintained in said cell 
nuclei as nuclear chromosomes are maintained. Said heterologous DNA may be contained in 
or may be an episomal element Preferably, however, said heterologous DNA is stably 
incorporated in a nuclear chromosome such that it can be inherited to progeny cells or 
progeny plants. Methods of producing plants; plant parts, or plant cell cultures that are stably 
transformed are known in the art of plant biotechnology. Such methods usually require the 
selection of transformants for stable transformation using a selective agent and a selectable 
marker gene. 
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Preferably, all cells of said plant, said plant part or said cell culture contain in their 
nuclei said heterologous DNA for giving a high yield of a protein of interest. More preferably, 
said cells contain said heterologous DNA stably integrated in a nuclear chromosome. In the 
case of a plant, this means that the plant is a transgenic plant. 

Said heterologous DNA having said sequence encoding said RNA replicon is 
operably linked or linkable to a transcription promoter. Alternatively, said sequence encoding 
said RNA replicon is operably linked or linkable to a transcription promoter. If said 
heterologous DNA or said sequence is operably linked to said transcription promoter, the 
transcription promoter is preferably a regulated promoter, like an inducible, tissue-specific or 
developmental^ regulated promoter in order to make expression of said sequence of interest 
regulatable. More preferably, said promoter is inducible or developmental^ regulated, which 
allows that expression is induced at a desired time or that expression is switched on when 
the plant reaches a defined developmental stage, respectively. For example, expression of 
said sequence of interest can be switched on in plant seeds if said promoter is a seed- 
specific promoter. The invention is of high value for providing specific tissues (e.g. seed 
tissues) with RNA replicons, whereby said tissues may be less suited for cell-to-cell spread 
of said replicon than e.g. leaf tissue. Most preferred are chemically regulated promoters, 
since they allow to switch on expression at will in all or in most tissues of a plant. Most 
importantly, chemically inducible promoters are the promoters of choice for large scale 
applications, since the chemical inducer can be applied to a large number of plants at the 
same time. Examples of regulated promoters are known in the art. Large-scale applications 
are applications where a chosen sequence of interest is expressed in many plants 
concomittantly. Such a large-scale application may be performed in a greenhouse. 

Said sequence encoding an RNA replicon may alternatively be operably linkable to a 
transcription promoter, which allows to switch on expression of said sequence of interest by 
operably linking said sequence encoding an RNA replicon with a promoter. There are several 
ways of reducing this embodiment to practice. One option is to separate, in said 
heterologous DNA, said sequence encoding an RNA replicon and said promoter by a 
sequence block that precludes an operable linkage between said promoter and said 
heterologous DNA. Said sequence block may be flanked by recombination sites such that 
said block can be cut out by a recombinase recognizing said recombination sites. Thereby, 
operable linkage for transcription of said sequence encoding an RNA replicon can be 
established and expression may be switched on. Another option is to have a portion of a 
sequence necessary for transcription (e.g. a promoter or promoter portion) in flipped 
orientation and flanked by recombination sites. Providing a suitable recombinase may flip 
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said sequence portion back in correct orientation, whereby an operable linkage can be 
established. 

In one embodiment of the invention, said sequence encoding an RNA replicon has 
one or more segments that code together for said RNA replicon, i.e. said RNA replicon is not 
encoded by one continuous DNA. Instead, said RNA replicon is encoded discontinuously by 
two or more segments, whereby said segments may be present on the same chromosome 
preferably contiguous to each other. Formation of said RNA replicon may then require 
rearrangement of said segments, e.g. by recombination. As an example, a part of a 
sequence for replicon function (e.g. a part of a sequence coding for a replicase) may be 
present in said heterologous DNA in a flipped orientation relative to other parts of such a 
sequence. The flipped part may be flanked by recombination sites. Then the transcript of the 
heterologous DNA wilt not be a replicon, since said replicon function cannot be provided (e.g. 
because the transcript does not code for a functional replicase). Providing a site-specific 
recombinase recognizing the recombination sites can flip one of said segments back such 
that a replicon function is encoded continuously. In this embodiment, providing the 
recombinase may function as a switch for switching on replicon formation and expression of 
a sequence of interest (see further below). This embodiment is preferably performed in 
connection with stably transformed plants, plant parts, or plant cell cultures. 

Alternatively, said segments may be present on different chromosomes. Formation of 
an RNA replicon will then require transcription of both segments and trans-splicing of both 
transcripts for assembling said RNA replicon. This embodiment may be used for quickly 
segregating the segments that encode together said RNA replicon in progeny plants or cells 
as described in detail in PCT/EP03/02986. 

The process of the invention may comprise said steps (a) and (b). Step (a) may 
comprise stable or transient transformation of a plant, plant part or plant cell culture with said 
heterologous DNA of the invention. As discussed above, stable transformation of a nuclear 
chromosome is preferred. Preferably, the process of the invention is a process of expressing 
a protein of interest encoded by said sequence of interest. Step (b) comprises causing 
expression of said sequence of interest, e.g. switching on said expressing. Various methods 
of causing or switching on expression have already been mentioned. Examples include 
inducing an inducible promoter operably linked to said heterologous DNA; bringing said 
heterologous DNA under operable linkage to a promoter using recombination; establishing 
continuous coding of a sequence for replicon formation using recombination etc. If a 
recombinase is used for switching on the process of the invention, said recombinase may be 
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provided to said plant, plant part or plant cell culture transiently, whereby said providing 
would act as a switch for step (b). Alternatively, said recombinase may be stably encoded in 
cells, and expressing of the recombinase under control of a regulated, preferably inducible, 
promoter. Inducing recombinase expression by inducing said promoter may then cause 
expression in step (b). In the case of transient transformation, step (b) may be automatically 
achieved by performing step (a). 

Preferably, the process of the invention is performed with many plants in parallel by 
providing many plants according to (a) and causing expression of said sequence of interest 
according to (b) with all plants in one step, e.g. by applying a chemical inducer for a 
chemically inducible promoter to all plants for example by spraying. 

In an important embodiment of the process of the invention, said plant or said plant 
part (e.g. leaves) are transiently transformed with said heterologous DNA of the invention for 
transient expression of said sequence of interest. The term "transient transformation" means 
the introduction of said heterologous DNA without selection of transformed cells for stable 
incorporation of said heterologous DNA into a plant chromosome. Transient transformation 
usually provides for transient expression of the gene(s) encoded by heterologous DNA. 
Transient transformation can be achieved by any of the transformation methods given below. 
However, it is preferably performed by iAgroftacfer/t/m-mediated transient transformation of 
T-DNA containing said heterologous DNA of the invention. A preferred method of 
>Agro6acfer/t/m-mediated transient transformation is agroinfiltration. Agroinfiltration (agro- 
inoculation) is most preferred. The highest fastest and highest expression levels of said 
sequence of interest can be obtained if entire plants (i.e. the parts above the soil including all 
leaves) are transformed by agroinfiltration. This can be achieved by dipping the plant upside 
down in the Agrobacterium suspension, application of vacuum, and fast release of the 
vacuum. 

In a preferred embodiment of said process of transiently expressing a sequence of 
interest, said sequence encoding an RNA replicon is operably linked to a transcriptional 
promoter, preferably a constitutive transcriptional promoter. In another preferred 
embodiment, said plant belongs to the genus Nicotiana and said sequences for replicon 
function are derived from a tobamovirus, preferably from tobacco mosaic virus. In a 
particularly preferred embodiment, tobacco plants including the stem and all leaves are 
transiently transformed by agroinfiltration. The latter embodiment can be used for large-scale 
applications of the process of the invention. In large-scale applications, said process is 
concomitantly applied to many plants (at least 5, preferably at least 10, more preferably at 
least 100 plants). 
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The present invention may in principal be applied to any plants for which infectious 
RNA viruses exist. Suitable plant/RNA virus pairs may be derived from the list of RNA 
viruses given below. Due to the very high efficiency of replicon formation according to the 
invention, the plant species specificity of plant viruses is far less pronounced when this 
invention is practiced. Similarly, the present invention may be used with RNA replicons 
based on any RNA virus. RNA viruses have generally evolved outside the cell nuclei of their 
host plants and will have selected localities that make a replicon based on such a virus 
inefficient when the replicon is produced inside cell nuclei, notably if the replicon is stably 
encoded in a nuclear chromosome. The invention can be applied to all RNA viruses, 
although the level of improvement may vary between different plant RNA viruses. The most 
preferred plant RNA viruses the invention may be based on are tobamoviruses, notably 
tobacco mosaic virus, and Potexviruses such as potato virus X. In the case of tobacco 
mosaic virus, it will generally be the coat protein that is replaced by said sequence to be 
expressed. The movement protein may be removed or replaced by a sequence to be 
expressed. Preferably, however, an RNA replicon derived from tobacco mosaic virus should 
code for the movement protein and have the coat protein be replaced by said sequence to be 
expressed. It is highly preferred that said heterologous DNA lacks at least one open reading 
frame of said plant RNA virus, like a coat protein or a movement protein. 

The major application of the present invention is the production of a protein of interest 
in plants, plant parts or plant cell cultures. Said protein of interest is encoded by said 
sequence of interest. Said sequence of interest is preferably heterologous to said plant RNA 
virus. In any event, said sequence of interest is not a sequence having or encoding functions 
of said RNA virus. 

If the process of the invention is performed in plants, plants that do not enter the 
human or animal food chain are preferred, like Nicotians species (e.g. Nicotiana 
benthamiana, Nicotiana tabacum). Plant parts are e.g. plant organs or specific tissues of 
plants like leaves or seeds. Herein, seeds are considered as plant parts if the process of 
invention is done in seeds growing or being attached to a parent plant. Seeds are, however, 
also considered to be plants, albeit in a certain developmental stage of a plant. Most 
preferably, the plants of the invention are sold or distributed as seeds, the seeds are grown 
to plants, and expression of said sequence of interest is induced or switched on at a desired 
point in said plants. 

Many plant species like Nicotiana tabacum or Beta . vulgaris have hitherto been 
impossible to transform with a viral vector or a replicon by way of Agrobacterium-medlated 
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transformation. It may be surmised that the reason for this impossibility was the activation of 
. plant defense mechanisms in response to a double challenge of the plant with two 
pathogens, namely Agrobacterium and the viral vector. It has now been found by the 
inventors that the use of highly diluted suspension of Agrobacteria for Agrobacterium- 
mediated transformation allows to achieve a higher transformation efficiency with viral 
vectors. Thus, the invention achieves a broad applicability of Agrobacterium~rr\ed\ated viral 
vector transformation to many plant species. The highly diluted suspension of Agrobacteria 
for this embodiment has a concentration of cells of said Agrobacteria corresponding to a 
calculated optical density at 600 nm of at most 0.04, preferably at most 0.01 , more preferably 
at most 0.004, and most preferably at most 0.001 , whereby said calculated optical densities 
are defined by an least 25-fold, preferably at least 100-fold, more preferably at least 250-fold, 
and most preferably at least 1000-fold dilution, respectively, of a suspension of said 
Agrobacteria of an OD at 600 nm of 1.0. The plant species most preferably transformed 
according to this embodiment is Nicotiana tabacum. 

The transformation efficiency of Agrobacterium-mediated (RNA) viral vector 
transformation can further be improved by using in T-DNA the heterologous DNA according 
to the invention. Thus, the invention provides a process of expressing a sequence of interest 
in a plant, plant part, or plant cell culture, comprising: 

transforming a plant, plant part, or plant cell culture with a suspension of Agrobacteria, said 
Agrobacteria containing in T-DNA a heterologous DNA having a sequence encoding a 
replicon (preferably an RNA replicon) operably linked or linkable to a transcription promoter, 
wherein said sequence encoding a replicon contains 

(i) sequences for replicon function of said replicon, said sequences being derived from a 
sequence of a plant virus (preferably a plant RNA virus), 

(ii) a sequence of interest to be expressed, 

whereby said suspension of Agrobacteria has a concentration of cells of said Agrobacteria 
corresponding to a calculated optical density at 600 nm of at most 0.04, preferably at most 
0.01, more preferably at most 0.004, and most preferably at most 0.001, whereby said 
calculated optical densities are defined by an least 25-fold, preferably at least 100-fold, more 
preferably at least 250-fold, and most preferably at least 1000-fold dilution, respectively,- of a 
suspension of said Agrobacteria of an OD at 600 nm of 1 .0. 

The inventors have found that this process not only decreases the likelihood that cells 
of said Agrobacterium strain spread in the environment, thus improving the biological safety 
of this process. This process also improves the protein expression efficiency presumably by 
decreasing the exposure and stress for said plant or said plant leaves upon infection with an 
Agrobacterium strain that is a pathogen for said plant. The inventors have surprisingly found 
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that the efficiency of the process increases, within certain limits, with decreasing 
concentration of the Agrobacteria suspensions used for transforming or transfecting plants or 
plant parts. Notably, the ability for celi-to-cell movement of the replicons generated in cells of 
said plant improves with decreasing concentration of these Agrobacteria suspensions. The 
reasons for this unexpected phenomenon has not yet been identified. It is speculated that 
this phenomenon is due to a response of the plant to the infection by Agrobacteria and that 
this response does not occur (or occurs to a lesser extent) at lower Agrobacteria 
concentrations. In prior art transformation processes using Agrobacteria, much higher 
concentrations of Agrobacteria are used, generally in the range of an OD at 600 nm of 0.5 to 
1.0. 

Said plant or said plant leaves are preferably infiltrated with a suspension of cells of 
said Agrobacterium strain, said suspension having a concentration of Agrobacterium cells 
obtainable by diluting a suspension of sells of said Agrobacterium strain of an OD (optical 
density) of 1.0 at 600 nm at least 25-fold, preferably at least 100-fold, more preferably at 
least 250-fold, and most preferably at least 1000-fold. Such dilutions thus lead to 
Agrobacteria suspensions having calculated OD values at 600 nm of at most 0.04, preferably 
at most 0.01, more preferably at most 0.004, and most preferably at most 0.001, 
respectively. 

This process of using Agrobacteria suspensions with calculated OD values below 
0.04 can be combined with other embodiments described in this invention. Infiltration or agro- 
infiltration may be defined as a transformation or transfection method using a suspension of 
Agrobacteria, wherein a pressure difference is used for pressing Agrobacteria into plant 
tissue (intercellular space). 



BRIEF DESCRIPTION OF THE FIGURES 

Fig. 1 depicts the general principle of the invention, based on increased frequency of RNA 
virus-based replicon formation. 

Fig. 2 shows the intron prediction profile of transcribed region of vector plCH8543. 
Nucleotide numbers are given on the horizontal axis. The vertical axis shows the probability 
for corresponding sequence/sequence region to be a coding sequence (coding), to serve as 
donor site (Donor) or as acceptor site (Acceptor). Circled parts correspond to selected 
localities where said function conservative differences should be introduced. 
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Fig. 3 shows the intron prediction profile of the first half of the transcribed region of vector 
plCH15466. The circled regions were modified (compare Fig. 2A) with function-conservative 
differences according to the invention. 

Fig. 4 shows the intron prediction profile of the second half of the transcribed region of 
plCH1590. The circled regions were modified (compare with Fig. 2B) with function- 
conservative differences according to the invention. 

Fig. 5 shows the intron prediction profile of the transcribed region of plCH15499. The circled 
regions correspond to six inserted plant nuclear introns. 

Fig. 6 A and B are schematic representations of the T-DNA regions of vectors with and 
without function-conservative differences according to the invention. 

Fig. 7 shows GFP expression after agroinfiltration of viral constructs in Nicotiana 
benthamiana and Nicotiana tabacum leaves. The vector (pICH) identification number for 
each infiltrated area is indicated. 

7A - Nicotiana benthamiana, 8 days after agroinfiltration; 
7B - Nicotiana tabacum, 8 days after agroinfiltration; 

7C - Nicothiana benthamiana protoplasts isolated 5 days after agroinfiltration. Many light 
spots in the right picture indicate an extremely high frequency of replicon formation and GFP 
expression. 

Fig. 8 is a schematic representation of an RNA virus-based replicon precursor designed 
according to the present invention, which gives zero expression level of the gene of interest 
(GFP, indicated by G) in the non-induced state. 

P - transcription promoter; T - transcription termination region; SM - selectable marker 
gene; Ac2 - promoter of Arabidopsis ACTIN2 gene; RdRP viral RNA-dependent RNA 
polymerase; MP - viral movement protein; NTR - viral 3' non-translated region. 
Fig. 9 shows an intron prediction profile for Arabidopsis thaliana meiosis-specific gene 
AtDMCI (GenBank Acc.No U76670), using the direct strand (+ strand). The intron-coding 
regions are circled. 

Fig. 10 (A, B) shows the prediction of potential problematic regions (circled) within the direct 

strand (+ strand) of Potato Virus X (PVX) genome (GenBank Acc. No. AF1 72259). 

Fig. 1 1 shows in A, B, and C the prediction of potential problematic regions (circled) of the 

direct strand (+ strand) of alfalfa mosaic virus genomes of RNA1 (GenBank Acc. No 

K02703), RNA2 (GenBank Acc. No K02702) and RNA3 (GenBank Acc. No L00163), 

respectively. 

Fig. 12 depicts T-DNA regions of constructs plCH12691 and plCH16888. 
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P - transcription promoter; T - transcription termination region; SM - selectable marker 

gene; Ac2 - promoter of Arabidopsis ACTIN2 gene; RdRP viral RNA-dependent RNA 

polymerase; MP - viral movement protein; NTR - viral 3' non-translated region. 

Fig. 13 shows leaves under UV light of different stably transformed N. benthamiana lines 

carrying the T-DNA regions of either plCH12691 or plCH16888. The leaves were agro- 

infiltrated with vectors (plCH10881 or plCH14313) providing integrase. 

Fig. 14 shows leaves of Beta vulgaris one week after agro-infiltration with plCH18711 at day 

light (left) and UV (right) illumination. Light .patches in the right photograph indicate GFP 

fluorescence. Introns (spotted boxes) in the construct shown at the bottom are numbered. 



DETAILED DESCRIPTION OF THE INVENTION 

We have surprisingly found that the incorporation of plant introns into certain regions 
of plant viral RNA vectors and removal or replacement of cryptic introns within sequences for 
replicon function can dramatically increase (at least x10 2 folds) the efficiency of the 
appearance of said RNA replicons in the cytoplasm of host plants. Such increase in 
efficiency was reflected in at least one easily measurable parameter: relative proportion of 
cells showing replication of said vector, e.g. in increased frequency of replicon formation. 
Such optimisation of initiation of RNA replicon formation led to the ability of synchronized 
switching on of expression of a sequence of interest in a whole plant, resulting in 
dramatically increased yield of recombinant protein of interest in shorter time than for a non- 
modified vector. 

Despite of publications concerning the increase of nuclear transgene expression by 
incorporation of introns in coding regions of recombinant DNA (Mascarenhas et ai, 1990, 
Plant Mol. B/o/., 15, 913-920; Bourdon et a/., 2001, EMBO Reports, 2, 394-398; Rose, AB., 
2002, RNA, 8, 1444-1453; US5,955,330), there is no hint in the prior art showing that 
incorporation of introns into viral RNA replicons would have any positive effect on the 
frequency of viral replicon formation and subsequently, on the level of expression of a 
sequence of interest provided by said replicon. This effect is surprising considering that 
nuclear mRNA transcription and viral RNA replication take place in different sub-cellular 
compartments. Even if the cDNA copy of a viral replicon is placed in the nucleus, only the 
first copy of the viral replicon precursor is produced in nucleus and then amplified in the 
cytoplasm under conditions different from those in the nucleus. In the prior art,, the use of 
introns for preventing the cytotoxic effect of "leaky" expression of viral genes in E.coli during 
cloning with wild type virus cDNAs was described (Johansen, I.E. 1996, Proc. Natl. Acad. 
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Sci. USA, 93,. 12400-12405; Yang et ai, 1998, Arch. Virol., 143, 2443-2451; Lopez-Moya & 
Garcia, 2000, Virus Res., 68, 99-107). There is no hint that intron inclusion can increase the 
frequency of replicon formation from a viral cDNA clone. The results obtained for wild type 
RNA viruses and their cDNA copies cannot be compared with virus-derived expression 
vectors designed for the expression of a heterologous sequence of interest in plants, 
predominantly at the expense of other properties of wild type viruses like high infectivity and 
stability of said viruses. Infectivity is not an issue in the present invention. Notably, infectivity 
is not an issue in a process of expressing a sequence of interest in a stably transformed 
plant. Infectivity of a viral DNA vector or its transcript is also not an issue when a plant is 
transformed with Agrobacteria containing the DNA vector in T-DNA. 

The present invention provides a method for increasing fundamentally the frequency 
of RNA virus-derived replicon formation, said replicons are derived upon transcription of DNA 
precursor and designed for the expression of sequences of interest. This method overcomes 
the limitations of existing viral vector-based expression systems, such as size limitation for 
heterologous sequences to be expressed and high instability of said vectors. Further, said 
method offers better biosafety characteristics, allows to design leakage-proof control over 
transgene expression (zero expression level in non-induced state), as such design can be an 
integrated part of the strategy for said RNA virus-derived replicon design. By providing high 
frequency of RNA virus-derived replicon formation, the approach described herein allows for 
a rapid initiation of the expression of a sequence of interest in a whole plant, part of plant or 
plant cell culture containing in cell nuclei a heterologous DNA encoding said RNA replicon. 
By practicing the invention, the performance of practically any plant RNA virus-derived 
replicon designed for the expression of a heterologous sequence of interest can be improved 
significantly through dramatic increase of the frequency of replicon formation. 

RNA viruses belonging to different taxonomic groups are suitable for constructing 
RNA replicons according to this invention. A list of RNA viruses to which this invention can 
be appliedjs presented below. Taxa names in quotes (and not in italic script) indicate that 
this taxon does not have an ICTV international approved name. Species (vernacular) names 
are given in regular script. Viruses with no formal assignment to genus or family are 
indicated): 

RNA Viruses: 

ssRNA Viruses: Family: Bromoviridae, Genus: Alfamovirus, Type species: alfalfa 
mosaic virus, Genus: llarvirus, Type species: tobacco streak virus, Genus: Bromovirus, 
Type species: brome mosaic virus, Genus: Cucumovirus, Type species: cucumber 
mosaic virus; 
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Family: Closteroviridae, Genus: Closterovirus, Type species: beet yellows virus, 
Genus: Crinivirus, Type species: Lettuce infectious yellows virus, Family: Comoviridae, 
Genus: Comovirus, Type species: cowpea mosaic virus, Genus: Fabavirus, Type 
species: broad bean wilt virus 1 , Genus: Nepovirus, Type species: tobacco ringspot virus; 

Family: Potyviridae, Genus: Potyvirus, Type species: potato virus Y, Genus: 
Rymovirus, Type species: ryegrass mosaic virus, Genus: Bymovirus 9 Type species: 
barley yellow mosaic virus; 

Family: Sequiviridae, Genus: Sequivirus, Type species: parsnip yellow fleck virus, 
Genus: Waikavirus, Type species: rice tungro spherical virus; Family: Tombusviridae, 
Genus: Carmovirus, Type species: carnation mottle virus, Genus: Dianthovirus, Type 
species: carnation ringspot virus, Genus: Machlomovirus, Type species: maize chlorotic 
mottle virus, Genus: Necrovirus, Type species: tobacco necrosis virus, Genus: 
Tombusvirus, Type species: tomato bushy stunt virus, Unassigned Genera of ssRNA 
viruses, Genus: Capillovirus, Type species: apple stem grooving virus; 

Genus: Carlavirus, Type species: carnation latent virus; Genus: Enamovirus 9 Type 
species: pea enation mosaic virus, 

Genus: Furovirus, Type species: soil-borne wheat mosaic virus, Genus: 
Hordeivirus, Type species: barley stripe mosaic virus, Genus: Idaeovirus, Type species: 
raspberry bushy dwarf virus; 

Genus: Luteovirus, Type species: barley yellow dwarf virus; Genus: Marafivirus 9 
Type species: maize rayado fino virus; Genus: Potexvirus, Type species: potato virus 
X;Genus: Sobemovirus, Type species: Southern bean mosaic virus, Genus: Tenuivirus, 
Type species: rice stripe virus, 

Genus: Tobamovirus 9 Type species: tobacco mosaic virus, 

Genus: Tobravirus, Type species: tobacco rattle virus, 

Genus: Trichovirus, Type species: apple chlorotic leaf spot virus; Genus: 
Tymovirus 9 Type species: turnip yellow mosaic virus; Genus: Umbravirus 9 Type species: 
carrot mottle virus; Negative ssRNA Viruses: Order: Mononegavirales, Family: 
Rhabdoviridae 9 Genus: Cytorhabdovirus, Type Species: lettuce necrotic yellows virus, 
Genus: Nucleorhabdovirus 9 Type species: potato yellow dwarf virus; 

Negative ssRNA Viruses: Family: Bunyaviridae 9 Genus: Tospovirus, Type 
species: tomato spotted wilt virus; 

dsRNA Viruses: Family: Partitiviridae, Genus: Alphacryptovirus, Type species: 
white clover cryptic virus 1, Genus: Betacryptovirus 9 Type species: white clover cryptic 
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virus 2, Family: Reoviridaei Genus: Fijivirus, Type species: Fiji disease virus, Genus: 
Phytoreovirus, Type species: wound tumor virus, Genus: Oryzavirusjype species: rice 
ragged stunt virus; 

Unassigned Viruses: 

Genome: ssRNA, Species Garlic viruses A V B,C,D, Species grapevine fleck virus, 
Species maize white line mosaic virus, Species olive latent virus 2, Species: ourmia melon 
virus, Species Pelargonium zonate spot virus. 

The general principle of the invention is shown in Figure 1. It is known that plant RNA 
viruses (an exception are viroids - small non-coding RNAs amplifying in plant cell nuclei - for 
a review see Diener, T.O., 1999, Arch. ViroL Suppl., 15, 203-220; Flores, R., 2001, CR Acad. 
Sci. III, 324 , 943-952) never occur in the plant nucleus, but in the cytoplasm. Therefore, the 
sequences of RNA viruses might not be adapted to withstand nuclear RNA processing 
events due to the presence of motifs that might be involved in complex series of processing 
steps including transport of processed RNA in cytoplasm, in which pre-mRNAs, rRNA and 
tRNA precursors are involved. The processing events, such as 5' end capping, splicing, 3' 
end generation, polyadenylation, degradation, base and sugar modification as well as editing 
(in plastids and mitochondria) are intensively studied. However, many elements of such 
events still remain unclear. The most dramatic changes to pre-mRNA in the nucleus happen 
during pre-mRNA splicing, the process by which intervening RNA sequences (introns) are 
removed from the initial transcript and exons are concomitantly ligated. Splicing is mediated 
by the splicesome, a complex structure comprising uridilate-rich small nuclear 
ribonucleoprotein particles. The splicesome carries out the splicing reaction in two 
consecutive steps: the first one - cleavage at the 5' splice site of upstream exon/intron 
junction leading to lariat formation, and second step - cleavage at the 3' splice site of 
intron/downstream exon junction followed by upstream and downstream exons ligation (for 
review see: Kramer, A, 1996, Annu.Rew. Biochem., 65, 367-409; Simpson.GG. & Filipowicz, 
W. 1996, Plant Mol. Biol., 32, 1-41). The 5' and 3' splice site dinucleotides (57GU; AG/3') 
flanking the intron sequences are highly conserved in higher plants and single G 
replacement might abandon the splicing activity at the site concerned. It is surprising that 
despite of a high conservation of splice sites between plants and animals, heterologous 
introns in plants are usually not spliced or spliced incorrectly (van Santen, VL. et al., 1987, 
Gene, 56, 253-265; Wiebauer, K., Herrero, J J., Filipowicz, W. 1988, Mol. Cel. Biol. s fL 2042- 
2051). Considering that plant viral RNAs were not under evolutionary pressure to resist 
nuclear RNA processing machinery, these RNAs are very likely to become subject of such 
processing, including splicing, once they are placed into the nuclear environment. This 
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situation is completely different from that of RNA transcripts encoded by nuclear genes, as 
the latter transcripts are evolutionary adapted to preserve their functionality, despite of series 
of RNA modifications taking place in the nucleus. However, such modifications can have 
dramatic consequences for viral RNA replicon formation. Re-engineering of the plant virus in 
order to make expression vectors for heterologous genes might further add to the instability 
of RNA virus-based replicons, as it would add further elements that might interact with RNA 
sequences of viral origin, producing defective RNA that is unable to replicate. Our invention 
addresses these problems by subjecting the expression vector to modifications that 
significantly increase the frequency of functional RNA replicon formation, when the 
expression vector is introduced as a DNA precursor into plants or plant cells to provide for 
transient expression or for stable integration into plant chromosomal DNA. We believe that 
modifications of virus-derived sequences shall be the most profound solution for increasing 
the efficiency of RNA virus-based replicons. In this invention we predominantly focus on 
modifications (said function-conservative differences) within the plant RNA virus derived 
sequences, as they are crucial for increasing the efficiency of RNA replicon formation. 

Surprisingly, our first attempt to find evidence that potentially problematic regions do 
exist, was successful and even more surprisingly, we obtained experimental confirmation by 
finding unexpectedly an improvement of orders of magnitude. An analysis of the sequence 
derived from the RNA virus of expression vector plCH8543 (EXAMPLE 1, Fig..6A) using the 
Netgenell server program (http://www.cbs.dtu.dk/services/NetGene2/) for the presence of 
cryptic introns and RNA splicing sites showed the presence of intron-like regions that might 
be spliced by the nuclear RNA processing machinery (see circled regions in Fig. 2). There 
are many other programs that can be used to identify potentially problematic regions (said 
selected localities) within plant viral RNA sequences, such as exon/intron prediction program 
(http://genes.mit.edu/GENSCAN.htmi) or splicing signal prediction program 
(http://125.itba.mi.cnr.it/-webgene/wwwspliceview.html) for variety of organisms. 

Considering that all existing programs are not ideal and are subject to mistakes, the 
potential problematic regions can also be determined experimentally. This can be done by 
analyzing the transcripts derived from a DNA vector under test in a nuclear environment with 
the help of such a routine technique as RT-PCR (Frohman, MA., 1989, Methods Enzymol., 
218, 340-356) or its more advanced version suitable for precise quantification of the 
concentration of different transcripts called real-time PCR (Gibson et al, 1996, Genome 
Res., 6, 995-1001), preferably followed by sequencing of the PCR-amplified products. The 
function-conservative differences of the invention change dramatically the RNA profile, for 
example by replacing intron-like sequences with exon-like ones, e.g. by introducing silent 
mutations with replacement of A/U-rich regions (intron-like) with G/C-rich regions (exon-like) 
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(see Figure 3, circled regions). Plant introns, unlike exons, are usually A/T(U) rich (Lorkovic, 
ZJ. et al., 2000, Trends Plant Sci., 5, 160-167; Brown, JW. & Simpson, CG. 1998, Annu. 
Rev. Plant Physiol. Plant Mol. 8/b/., 49, 77-95; Csank, C. et a/., 1990, Nucl. Acid Res., 18, 
5133-5141;Goodall & Filipowicz, 1989, Cell, 58, 473-483), but there are exceptions, for 
example when in monocotyledonous plants G/C rich introns were found (Goodall & 
Filipowicz, 1989, Cell, 58. 473-483; Goodall & Filipowicz, 1991, EMBO J., 10, 2635-2644). 
For practicing this invention, selected localities of high A/T(U) content include not only 
sequence stretches of at least 20 nucleotides in length with at least 55%, preferably at least 
65%, most preferably 80% or a higher of A/T(U) content, but also shorter stretches ("islands") 
of 6-19 nucleotides in a row of purely A/T(U)-containing sequences. Herein, localities of high 
A/U content include sequences which are more A- than U-rich, sequences which are A-rich, 
sequences which are more U- than A-rich, and sequences which are U-rich. Additionally, any 
transcribed sequence of interest can be tested for post-transcriptional mpdifications that 
cause a change in nucleic acids sequences (e.g. RNA splicing) by RT-PCR (Frohman, MA. 
1989, Methods Enzymol., 218, 340-356). It is a trivial task for those familiar with the art to 
use RT-PCR for detecting the regions within RNA that are subject to post-transcriptional 
modifications like deletions of sequences from the original RNA transcript. In EXAMPLE 2 we 
demonstrate that the modification of A/U rich region increases the number of GFP 
expressing cells at least 10-fold. This is clearly demonstrated in Fig. 7 by comparing the 
areas agroinfiltrated with plCH15466 (modified vector, Fig. 6A) and plCH14833 (control 
vector, Fig. 6A). Removing the movement protein (MP) allows for an accurate count of 
primary cells possessing functional RNA replicons, as cell-to-cell movement from the site of 
primary infection to neighbouring cells does not take place. In EXAMPLE 3, the modification 
of another U-rich intron-like region containing many cryptic splice sites (Fig.2B) and covering 
the subgenomic promoter of the movement protein (MP) was performed (Fig. 4, circled). This 
modification gave a dramatic effect on the increase of the frequency of replicon formation 
from viral vector plCH1590. As it was established in protoplast counting experiments 
(EXAMPLE 3), the increase was approximately 100-fold in comparison with unmodified 
vector pICH 14833 for both tested Nicotiana species - N. benthamiana and N. tobacco (see 
the corresponding infiltrated areas in Figure 7, A, B). In general, by using the approaches 
described in this invention, the frequency of RNA replicon formation can be increased 
approx. 300-fold, i.e. increasing the proportion of cells with functional replicons from about 
0.2% (control vector) to more than 50% (modified vector). We believe this is not the limit and 
reaching a frequency of 100% is very realistic. 

Such a high efficiency of replicon formation opens the door for expressing two or 
more different genes from two different RNA replicons within the same plant cell, e.g. co- 
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expressing different genes by using plant RNA virus based vectors. The achievement of 
synchronized release of two or more replicons at same time in the same cell is crucial for 
such co-expression, as the principle "first come, first served" is especially true for viral 
vectors. Systemic or cell-to-cell movement does not help, as different viral vectors do 
normally not overlap in their areas of spread or such overlap is insignificant. Simple 
calculations demonstrate the importance of the technology described in this invention for 
achieving co-expression of two sequences of interest in the same plant cell from two 
replicons. In the case of a non-optimised viral vector with a frequency of functional replicon 
formation of only 0.2% of all cells, the proportion of cells co-expressing two genes from two 
different RNA replicons will be 0.2x0.2=0.04%, while for the construct with increased 
frequency of functional RNA replicon formation (50% or 1/2 of all cells), said proportion of 
cells will be 0.5X0.5=0.25 or 25%, e.g. about 625-fold higher. With some of the best 
performing vectors (e.g. plCH16191, Fig. 7C) the proportion of cells having the functional 
replicon reaches ca. 90% (Fig. 7C, top right). This means that using such a vector for 
expressing two different sequences of interest from two independent replicons, co- 
expression can take place in about 80% of all cells. It appears very likely that the technology 
can be further improved and that 100% co-expression can be reached. 

It is worth to note that function-conserative differences in heterologous sequences of 
interest to be expressed from said RNA replicon might also be used to increase the 
frequency of RNA replicon formation, notably in combination with differences in sequences 
for replicon function. For example, modifications within said sequences of interest can be 
introduced that are necessary for formation and/or processing of said replicon. 

In an important embodiment of this invention, the frequency of replicon formation is 
improved by inserting nuclear introns in said sequences for replicon function (EXAMPLE 4). 
The incorporation of introns into the coding region of viral RNA-dependent RNA polymerase 
(RdRP) (EXAMPLES 4 and 8) resulted in a significant (at least 50-fold) increase in the 
frequency of replicon formation from (Fig. 7A,B) vectors carrying function-conservative 
differences as defined herein (plCH15034, plCH15025, pICH15499 in Fig. 6 A,B). The RNA 
profile for a vector containing 6 inserted introns from Arabidopsis is shown in Figure 5. In 
another example (EXAMPLE 7), insertion of introns in MP sequences increases the 
frequency of replicon formation at least 100 times. 

Many nuclear introns can be used to practice this invention. Examples of such introns 
include but are not limited to the introns from rice tpi Act1, and salT genes (Rethmeier et a/., 
1997, Plant J., 12, 895-899; Xu et aL, 1994, Plant Physiol., 100, 459-467; McElroy et a/., 
1990, Plant Cell, 2, 163-171); from the maize Adh1 f GapA1, actin and Bz1 genes (Callis et 
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a/., 1987, Genes Dev.. 1, 1183-11200; Donath et a/., 1995, Plant Mol. Biol, 28, 667-676; 
Maas et ai, 1991, Plant Mol. Biol., 16, 199-207; Sinibaldi &Mettler, 1992, in WE Cohn, K 
Moldave, eds, Progress in Nucleic Acids Research and Molecular Biology, vol 42, Academic 
Press, New York, pp229-257), from petunia rubisco gene SSU301 (Dean et ai, 1989, Plant 
Cell, 1^ 201-208), Arabidopsis A1 EF1a, UBQ10, UBQ3, PAT1 genes (Curie et a/.,1993, Mol. 
Gen. Genet 228. 428-436; Norris et a/., 1993, Plant Mol. Biol., 2± t 895-906; Rose & Last, 
1997, Plant J., 11,455-464) and many others. Synthetic introns can also be used for this 
invention. The smallest usable introns or their parts may be limited to splice donor and 
acceptor sites which usually flank the internal intron sequences. Preferably, the introns 
should have a size of at least 50 nt, more preferably a size of 100 to 200 nt, but actually 
there are no limitations regarding the . size of the introns. However, the size of the construct 
should be kept suitable for manipulations. The origin of the intron, its structure and size may 
be selected individually depending on the nature of the vector. Transient expression 
experiments may be used for testing the efficiency of a chosen intron or the corresponding 
intron parts. 

The modifications described above have a cumulative effect, e.g. if intron insertion(s) 
are combined with a modification of the MP subgenomic promoter, the increase in frequency 
of replicon formation can be approx. 300-fold (EXAMPLE 5). The preferred regions for intron 
insertions in order to have an increase in the frequency of RNA replicon formation are called 
selected localities herein. Such localities may contain "intron-like" structures. This is 
confirmed by the insertion of introns in MP, actually in close proximity to such a problematic 
region as the MP subgenomic promoter (EXAMPLE 7). A 100-fold increase in frequency of 
replicon formation was observed. Insertion of introns into "exon-like" regions does not have 
such a pronounced effect as insertion in said intron-like regions (EXAMPLE 6). 

The experiments discussed above were done with transient expression systems 
based on Agrobacterium-mediated DNA precursor delivery into plant cells. However, the 
most useful application of this invention will be for transgenic plants with a DNA precursor of 
said RNA replicon stably incorporated into a plant nuclear chromosome. This allows to 
overcome many limitations of plant viral vector-based systems, such as the restrictions to the 
maximal size of heterologous sequences viral vectors can tolerate. As the DNA precursor will 
be present in each cell of the transgenic plant, there is no absolute requirement for systemic 
movement or for cell to cell movement of the RNA replicon (replicon spreading). This can be 
compensated by the high efficiency of formation and transport of the RNA replicons of the 
invention into the cytoplasm. However, the ability of the vector for cell-to-cell movement can 
be of an additional value, as RNA replicon formation does not always occur in all cells. 



WO 2005/049839 



PCTYEP2004/012743 



25 

Different methods may be used for providing a plant cell with said heterologous DNA. 
Said vectors may be transformed into plant cells by a Ti-plasmid vector carried by 
Agrobacterium (US 5,591,616; US 4,940,838; US 5,464,763) or particle or microprojectile 
bombardment (US 05100792; EP 00444882B1; EP 00434616B1). Other plant transformation 
methods can also be used like microinjection (WO 09209696; WO 09400583A1;. EP 
175966B1), electroporation (EP00564595B1; EP00290395B1 ; WO 08706614A1) or PEG- 
mediated transformation of protoplasts etc. The choice of the method for vector delivery may 
depend on the plant species to be transformed. For example, microprojectile bombardment is 
generally preferred for monocot transformation, while for dicots, Agrobacterium-medlated 
transformation gives better results in general. 

In the examples described below, we used Agrobacterium-mediated delivery of 
vectors (said heterologous DNA) into Nicotiana cells. However, said vectors may be 
introduced into the plants in accordance with any of the standard techniques suitable for 
stable or transient transformation of the plant species of interest. Transformation techniques 
for dicotyledonous are well known in the art and include Agrobacterium-based techniques 
and techniques which do not require Agrobacterium. Non-Agrobacterium techniques involve 
the uptake of exogenous genetic material directly by protoplasts or cells. These techniques 
include PEG or electroporation mediated uptake, particle bombardment-mediated delivery 
and microinjection. Examples of these techniques are described in Paszkowski et a/., EMBO 
J 3, 2717-2722 (1984), Potrykus et a/., Mol. Gen. Genet 199,169-177 (1985), Reich et a/., 
Biotechnology 4:1001-1004 (1986), and Klein et a/., Nature 327,70-73 (1987). In each case, 
the transformed cells are regenerated to whole plants using standard techniques. 

Agrobacterium-mediated transformation is a preferred technique for the 
transformation of dicotyledons because of its high transformation efficiency and its broad 
utility with many different species. The many crop species which may be routinely 
transformed by Agrobacterium include tobacco, tomato, sunflower, cotton, oilseed rape, 
potato, soybean, alfalfa and poplar (EP 0 317 511 (cotton), EP 0 249 432 (tomato), WO 
87/07299 (Brassfca), U.S. Patent 4,795,855 (poplar)). 

Agrobacterium transformation typically involves the transfer of the binary 
vector carrying the foreign DNA of interest into an appropriate Agrobacterium strain which 
may depend on the complement of vir genes carried by the host Agrobacterium strain either 
on a co-resident plasmid or chromosomally (Uknes et a/., Plant Cell 5:159-169 (1993). The 
transfer of the recombinant binary vector to Agrobacterium may be accomplished by a 
triparental mating procedure using E. coli carrying the recombinant binary vector, a helper E. 
coli strain which carries a plasmid such as pRK2013, which is able to mobilize the 
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recombinant binary vector to the target Agrobacterium strain. Alternatively, the recombinant 
binary vector may be transferred to Agrobacterium by DNA transformation (Hofgen & 
Willmitzer, Nuci. Acids Res. 16, 9877 (1988)). 

Transformation of the target plant species by recombinant Agrobacterium usually 
involves co-cultivation of the Agrobacterium with explants from the plant following protocols 
known in the art. Transformed tissue carrying an antibiotic or herbicide resistance marker 
present between the binary plasmid T-DNA borders may be regenerated on selectable 
medium. This allows the generation of transgenic plants stably transformed on a nuclear 
chromosome with in T-DNA containing said heterologous DNA of the invention. 

In the examples of this invention, in parallel with stable agro-transformation we used 
agro-inoculation, a method of >Agrobacferium-mediated delivery of T-DNA for transient 
expression of gene(s) of interest (Vaquero et a/., 1999, Proc. Natl. Acad. ScL USA, 96, 
1 1 128-1 1 133). Agro-inoculation is an extremely useful tool not only for small-to-middle scale 
recombinant protein production systems, but as an element of a vector optimisation system, 
allowing to obtain fast results with different variants of constructs. 

The invention can also be used for large-scale/industrial production of recombinant 
proteins. Overnight cultures of Agrobacterium were used in our experiments. The overnight 
culture was prepared for agro-infiltration, as described in the prior art (Marillonnet et al., 
2004, Proc. NatL Acad. ScL USA., 101, 6853-6857). Usually, an overnight culture reaches an 
optical density (O.D.) of 3 - 3.5 units at a wavelength 600 nm and is diluted 3-5 times 
before agro-infiltration, yielding in general 5-9 x 10 9 colony forming units (Turpen et al., 
1993, J. Virol. Methods, 42, 227-240). We have found that a 10 2 , preferably a 10 3 and more 
preferably a 10 4 fold dilution of auch an overnight culture works very efficiently, especially in 
combination with sequences for replicon function having said function-conservative 
differences as described herein. Surprisingly, the vectors in infiltrated tobacco leaves further 
improved their performance giving better yield of GFP with increasing dilutions of the 
transforming Agrobacteria. For example, a 10 3 -fold dilution gave better result than a 10 2 -fold 
dilution. A 10 2 -fold dilution provides better GFP yield than a 10-fold dilution. A possible 
explanation for this phenomenon is the negative effect of highly concentrated Agrobacterium 
suspension on the function of a viral vector, e.g. on cell-to-cell movement, possibly as the 
result of a plant response to high concentrations of pathogenic bacteria. This phenomenon is 
of special value for large-scale industrial protein expression processes, as it allows to reduce 
the amount of agrobacteria required for recombinant protein production via agro-infiltration by 
at least one order of magnitude compared to prior art processes. 



WO 2005/049839 



PCT/EP2004/012743 



27 

In EXAMPLE 9 of this invention, a DNA precursor of an inactivated viral RNA-based 
replicon is stably incorporated into chromosomal DNA. Said replicon is optimised according 
to the invention. In addition, the replicon contains a structure preventing expression of the 
sequence of interest. Expression as well as formation of the functional RNA replicon can be 
triggered by flipping one part of the construct with the help of site-specific recombination. 
Said flipping can lead to the formation of two introns as well as to the assembly of a 
functional sequence of interest. The system described in EXAMPLE 9 shows not only the 
optimisation of a viral vector but also the solution for avoiding "leakiness" of constructs stably 
integrated into chromosomal DNA, including the "leaky" expression of the gene of interest 
from said construct. In many applications, it is crucial to have zero level expression in the 
uninduced state, especially for cytotoxic proteins or for achieving high biosafety standards 
with plant expression systems for expressing technical or pharmaceutical proteins. 

Transcription of the heterologous DNA and/or of said recombinase can be under the 
control of an inducible or any other regulated (e.g. developmental^ regulated) promoter. 
Inducible promoters can be divided into two categories according to their induction 
conditions: those induced by abiotic factors (temperature, light, chemical substances) and 
those that can be induced by biotic factors, for example, pathogen or pest attack. Examples 
of the first category are heat-inducible (US 05187287) and cold-inducible (US05847102) 
promoters, a copper-inducible system (Mett et al., 1993, Proc. Natl. Acad. Sci., 90, 4567- 
4571), steroid-inducible systems (Aoyama & Chua, 1997, Plant J., 11, 605-612; McNellis et 
al., 1998, Plant J., 14, 247-257; US06063985), an ethanol-inducible system (Caddick et al., 
1997, Nature Biotech., 16, 177-180; WO09321334), and a tetracycline-inducible system 
(Weinmann et al., 1 994, Plant J., 5, 559-569). One of the latest developments in the area of 
chemically inducible systems for plants is a chimaeric promoter that can be switched on by 
glucocorticoid dexamethasone and switched off by tetracycline (Bohner et al., 1999, Plant J., 
19, 87-95). For a review on chemically inducible systems see: Zuo & Chua, (2000, Current 
Opin. BiotechnoL, 1i 146-151) and Padidam, M (2003, Cum Opin. Plant Biol., 6, 169-177). 
Other examples of inducible promoters are promoters which control the expression of 
patogenesis-related (PR) genes in plants. These promoters can be induced by treatment of a 
plant with salicylic acid, an important component of plant signaling pathways in response to 
pathogen attack, or other chemical compounds (benzo-1 ,2,3-thiadiazole or isonicotinic acid) 
which are capable of triggering PR gene expression (US05942662). 

This invention is not limited to TMV-based vectors described in examples 1 - 9, but 
can be extended to replicons based on other plant RNA viruses. The analysis of other plant 
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viral RNA sequences (EXAMPLE 10, Figures 10, 11) shows selected localities very similar to 
those described for TMV and the sequences of pre-mRNA of plant nuclear genes (Fig. 9). 
This is strong evidence supporting the suggestion that, using the approaches described in 
this invention, practically any plant RNA virus-derived replicon can be improved 
fundamentally by removing/replacing problematic regions and/or inserting nuclear introns. 

The present invention is preferably carried out with higher multi-cellular plants, parts therof, 
or cell cultures thereof. Plants for the use in this invention include any plant species with 
preference given to agronomically and horticulturally important species. Common crop plants 
for the use in present invention include alfalfa, barley, beans, canola, cowpeas, cotton, corn, 
clover, lotus, lentils, lupine, millet, oats, peas, peanuts, rice, rye, sweet clover, sunflower, 
sweetpea, soybean, sorghum triticale, yam beans, velvet beans, vetch, wheat, wisteria, and 
nut plants. The plant species preferred for practicing this invention include, but not restricted 
to, representatives of Gramineae, Compositeae, Solanaceae and Rosaceae. 

Further preferred species for the use in this invention are plants from the following genera: 
Arabidopsis, Agrostis, Allium, Antirrhinum, Apium, Arachis, Asparagus, Atropa, Avena, 
Bambusa, Brassica, Bromus, Browaalia, Camellia, Cannabis, Capsicum, Cicer, 
Chenopodium, Chichorium, Citrus, Coffea, Coix, Cucumis, Curcubita, Cynodon, Dactylis, 
Datura, Daucus, Digitalis, Dioscorea, Elaeis, Eleusine, Festuca, Fragaria, Geranium, 
Glycine, Helianthus, Heterocallis, Hevea, Hordeum, Hyoscyamus, Ipomoea, Lactuca, Lens, 
Lilium, Linum, Lolium, Lotus, Lycopersicon, Majorana, Malus, Mangifera, Manihot, Medicago, 
Nemesia, Nicotiana, Onobrychis, Oryza, Panicum, Pelargonium, Pennisetum, Petunia, 
Pisum, Phaseolus, Phleum, Poa, Prunus, Ranunculus, Raphanus, Ribes, Ricinus, Rubus, 
Saccharum, Salpiglossis, Secale, Senecio, Setaria, Sinapis, Solanum, Sorghum, 
Stenotaphrum, Theobroma, Trifolium, Trigonella, Triticum, Vicia, Vigna, Vitis, Zea, and the 
Olyreae, the Pharoideae and many others. 

Most preferred plants for this invention are plants that do not enter the animal or 
human food chain like Nicotiana species, e.g. Nicotiana benthamiana and Nicotiana 
tabacum. 

Proteins of interest, their fragments (functional or non-functional) and their artificial 
derivatives that can be expressed in plants or plants cells using the present invention 
include, but are not limited to: starch modifying enzymes (starch synthase, starch 
phosphorylation enzyme, debranching enzyme, starch branching enzyme, starch branching 
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enzyme II, granule bound starch synthase), sucrose phosphate synthase, sucrose 
phosphorylase, polygalacturonase, polyfructan sucrase, ADP glucose pyrophosphorylase, 
cyclodextrin glycosyltransferase, fructosyl transferase, glycogen synthase, pectin esterase, 
aprotinin, avidin, bacterial levansucrase, E.coli glgA protein, MAPK4 and orthologues, 
nitrogen assimilation/methabolism enzyme, glutamine synthase, plant osmotin, 2S albumin, 
thaumatin, site-specific recombinase/integrase (FLP, Cre, R recombinase, Int, SSVI 
Integrase R, Integrase phiC31, or an active fragment or variant thereof), oil modifying 
enzymes (like fatty acids desaturases, elongases etc), isopentenyl transferase, Sea M5 
(soybean calmodulin), coleopteran type toxin or an insecticidally active fragment, ubiquitin 
conjugating enzyme (E2) fusion proteins, enzymes that metabolise lipids, amino acids, 
sugars, nucleic acids and polysaccharides, superoxide dismutase, inactive proenzyme form 
of a protease, plant protein toxins, traits altering fiber in fiber producing plants, Coleopteran 
active toxin from Bacillus thurlngiensis (Bt2 toxin, insecticidal crystal protein (ICP), CrylC 
toxin, delta endotoxin, polyopeptide toxin, protoxin etc.), insect specific toxin AalT, cellulose 
degrading enzymes, E1 cellulase from Acidothermus celluloticus, lignin modifying enzymes, 
cinnamoyl alcohol dehydrogenase, trehalose-6-phosphate synthase, enzymes of cytokinin 
metabolic pathway, HMG-CoA reductase, E. coll inorganic pyrophosphatase, seed storage 
protein, Erwinia herbicola lycopen synthase, ACC oxidase, pTOM36 encoded protein, 
phytase, ketohydrolase, acetoacetyl CoA reductase, PHB (polyhydroxybutanoate) synthase, 
enzymes involved in the synthesis of polyhydroxylalkanoates (PHA), acyl carrier protein, 
napin, EA9, non-higher plant phytoene synthase, pTOM5 encoded protein, ETR (ethylene 
receptor), plastidic pyruvate phosphate dikinase, nematode-inducible transmembrane pore 
protein, trait enhancing photosynthetic or plastid function of the plant cell, stilbene synthase, 
an enzyme capable of hydroxylating phenols, catechol dioxygenase, catechol 2,3- 
dioxygenase, chloromuconate cycloisomerase, anthranilate synthase, Brassica AGL15 
protein, fructose 1,6-biphosphatase (FBPase), AMV RNA3, PVY replicase, PLRV replicase, 
potyvirus coat protein, CMV coat protein, TMV coat protein, luteovirus replicase, MDMV 
messenger RNA, mutant geminiviral replicase, Umbellularia californica C12:0 preferring acyl- 
ACP thioesterase, plant C10 or C12:0 preferring acyl-ACP thioesterase, C14:0 preferring 
acyl-ACP thioesterase (luxD), plant synthase factor A, plant synthase factor B, D6- 
desaturase, proteins having an enzymatic activity in fatty acids biosynthesis and 
modifications, e.g. the peroxysomal p-oxidation of fatty acids in plant cells, acyl-CoA oxidase, 
3-ketoacyl-CoA thiolase, lipase, maize acetyl-CoA-carboxylase, etc.; 5-enolpyruvylshikimate- 
3-phosphate synthase (EPSP), phosphinothricin acetyl transferase (BAR, PAT), CP4 protein, 
ACC deaminase, protein having posttranslational cleavage site, DH PS gene conferring 
sulfonamide resistance, bacterial nitrilase, 2,4-D monooxygenase, acetolactate synthase or 
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acetohydroxyacid synthase (ALS, AHAS), polygalacturonase, Taq polymerase, bacterial 
nitrilase, many other enzymes of bacterial or phage origin including restriction 
endonucleases, methylases, DNA and RNA ligases, DNA and RNA polymerases, reverse 
transcriptases, nucleases (DNases and RNases), phosphatases, transferases etc. 

The present invention can be used for the purpose of molecular farming and 
purification of commercially valuable and pharmaceutically important proteins including 
industrial enzymes (cellulases, lipases, proteases, phytases etc.) and fibrous proteins 
(collagen, spider silk protein, etc.). Human or animal health protein may be expressed and 
purified using described in our invention approach. Examples of such proteins of interest 
include inter alia immune response proteins (monoclonal antibodies, single chain antibodies, 
T cell receptors etc.), antigens including those derived from pathogenic microorganisms, 
colony stimulating factors, relaxins, polypeptide hormones including somatotropin (HGH) and 
proinsulin, cytokines and their receptors, interferons, growth factors and coagulation factors, 
enzymatically active lysosomal enzyme, fibrinolytic polypeptides, blood clotting factors, 
trypsin, trypsinogen, a1 -antitrypsin (AAT), human serum albumin, glucocerebrosidases, 
native cholera toxin B, thrombin, human gastric lipase, granulocyte-macrophage colony 
stimulating factor (GM-CMF), serpin, lactoferrin, lisozyme, oleosin, prothrombin, alpha- 
galactosidase, as well as function-conservative proteins like fusions, mutant versions and 
synthetic derivatives of the above proteins. 

The content of Internationsl patent application PCT/EP03/1 2530 and European patent 
application 04016012.9 are incorporated herein by reference in their entireties. 
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EXAMPLES 

The following examples are presented to illustrate the present invention. Modifications and 
variations may be made without departing from the spirit and scope of the invention. 

EXAMPLE 1 

Construction of a TMV-based RNA vector 

Cloned cDNAs of the crucifer-infecting tobamovirus (cr-TMV; Dorokhov et ai. t 1994, 
FEBS Lett. 350 , 5-8) and of the turnip vein-clearing virus (TVCV; Lartey et a/., 1994, Arch. 
Virol. 138, 287-298) were obtained from Prof. Atabekov from Moscow University, Russia. A 
viral vector containing a green fluorescence protein (GFP) gene was made in several cloning 
steps. The resulting construct, plCH8543 (Fig. 6A), contains in sequential order a 787 bp 
fragment from the Arabidopsis actin 2 promoter (ACT2, ref An et al, 1996, GenBank 
accession AB026654, bp 57962 to 58748), the 5' end of TVCV (GenBank accession 
BRU03387, bp 1 to 5455), a fragment of cr-TMV (GenBank accession Z29370, bp 5457 to 
5677, with thymine 5606 changed to cytosine to remove the start codon of the coat protein, 
CP), sequences "taa teg ata act cga g", a synthetic GFP (sGFP) gene, cr-TMV 3' 
nontranslated region (3' NTR; GenBank accession Z29370, bp 6078 to 6312), and finally the 
nopaline synthase (Nos) terminator. The entire fragment was cloned between the T-DNA left 
(LB) and right (RB) borders of pICBVIO, a Carb R pBIN19-derived binary vector. plCH8543 
was transformed into Agrobacterium strain GV3101 and infiltrated into a Nicotiana 
benthamiana leaf. Foci of GFP fluoresence that appeared at 3 dpi grew and became 
confluent Surprisingly, even though most cells in the infiltrated area finally expressed GFP 
due to viral replication and movement, only a fraction of the cells initiated viral replication, as 
detected by a number of independent GFP expressing foci, it becamce clear that the limiting 
factor is not DNA delivery to plant cells, since infiltration of Nicotiana benthamiana leaves 
with a GFP gene under control of the 35S promoter leads to GFP expression in almost every 
cell in the infiltrated area (not shown). 

To confirm this observation, we made a viral vector construct containing a mutation in 
the MP. This construct, called plCH14833, is similar to plCH8543 but differs by a deletion of 
389 bp in the MP gene, upstream of the EcoRI site present in the MP. The sequence of the 
Ncol to EcoRI fragment that includes this deletion is given in the annex as SEQ ID No. 1. 
The entire viral construct (from the ACT2 promoter to the Nos terminator) was cloned 
between the T-DNA left and right borders of pICBV49, a pBIN19-derived Kan R binary vector. 
Due to the deletion in the MP, replicons produced from this construct cannot move from cell 
to cell but are able to replicate autonomously within a cell. Cell to cell movement can be 
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restored when MP is provided in trans, e.g. from a constitutive promoter such as the 
cauliflower mosaic virus 35S promoter. 

To make an MP expression construct, the TVCV MP gene was amplified by PGR 
from cloned TVCV cDNA (GenBank accession Z29370, bp 4802 to 5628) and subcloned in a 
binary vector under control of the 35S promoter. The resulting construct, called plCH10745 
(not shown), and plCH14833 were transformed into Agrobacterium strain GV3101 and 
various dilutions of an overnight culture were infiltrated in Nicotians benthamiana leaves as 
described by English and colleagues (1997, Plant J., 12, 597-603), except that the infiltration 
media lacked acetosyringone. Infiltration of plCH14833 alone led to the appearance of a few 
GFP expressing cells within the infiltrated area. By counting protoplasts prepared from the 
infiltrated area, we found that only one to three protoplasts expressed GFP from a total of 
500 protoplasts (0.2 to 0.6%). Coinfiltration of plCH14833 and plCH10745 led to the 
formation of GFP-expressing foci that grew from each initial GFP-expressing cell. Ultimately, 
due to cell-to-cell movement, a large proportion of cells in the infiltrated area expressed GFP 
(Fig. 7A). 

RNA viruses such as tobamoviruses replicate in the cytoplasm and never enter the 
nucleus. Therefore, they have evolved in an environment where they are not exposed to the 
nuclear pre-mRNA processing machinery. As a result, it is not surprising that RNA replicon 
transcripts generated in the nucleus from artificial viral constructs may not be recognized and 
processed properly by the RNA processing machinery. Moreover, RNA replicons from viral 
vectors are very large: approximately 7,000 nt in the case of the replicon based on TMV. 
Very few plant genes have such a large size and the majority of such genes contains introns 
that facilitate processing of the pre-mRNAs, export from the nucleus, and that improve the 
stability of the processed transcripts. We therefore hypothesized that modifications of the 
pre-mRNAs that would increase the efficiency of accurate processing and of export of 
correctly processed transcripts from the nucleus to the cytosol would lead to an increase of 
the number of cells that would initiate viral replication. It turned out that there are two 
approaches can be used to make RNA virus-based vectors that can more efficiently initiate 
viral replication after DNA delivery to the nucleus: (1) one approach is the removal of 
sequence features that might induce unwanted processing events (such as alternative 
splicing events using cryptic splice sites, or premature termination events); (2) a second 
approach is the addition of introns to increase the amount of properly processed transcripts, 
to improve export of the RNA from the nucleus to the cytoplasm, and/or to improve stability of 
the transcripts. 
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EXAMPLE 2 

Removal of intron-like sequences increases the frequency of viral RNA replicon formation in 
the cytoplasm 

We analyzed the sequence of the RNA replicon from plCH4351 using the Netgenell 
server program (http://www.cbs.dtu.dk/services/NetGene2/) and noticed several intron-like 
sequence features that might induce alternative splicing events. One such feature is a 0.6 kb 
uridine-rich region (corresponding to nt 827 to 1462 in GenBank accession BRU03387) at 
the beginning of the RdRP (Fig. 2A). This region was replaced in plCH14833 by a PCR- 
mutagenized sequence that differs from the original sequence by a 54 nucleotide substitution 
(sequence given in the annex as SEQ ID No. 2; cf. Fig. 3). The 52 nucleotide substitutions 
were made to replace T-rich sequences by more GC-rich sequences. All nucleotide 
substitutions were made silent so as not to change the RdRP protein sequence. This 
mutagenized fragment also contains two nucleotide substitutions (at position 829 and 1459; 
coordinates relative to GenBank accession BRU03387) that were introduced to remove 
putative cryptic splice donor and acceptor sites, respectively. To test the effect of these 
mutations, the resulting clone plCH15466 (Fig. 6A) was agroinfiltrated in N. benthamiana 
leaves with or without plCH10745 (movement protein in trans). Eight days after infiltration, a 
10-fold increase in the number of GFP expressing cells was observed in the area infiltrated 
with plCH15466 (compared to plCH14833, Fig. 7). This suggests that removal of intron-like 
sequences from the viral amplicon prevents unwanted alternative splicing events and results 
in more efficient initiation of viral replication. Coinfiltration of plCH15466 and plCH10745 
leads to cell-to-cell movement of the modified replicon at a similar speed as a non-modified 
replicon. This shows that the modification of the RNA sequence did not affect cell to cell 
movement of the viral vector. 

EXAMPLE 3 

Removal of intron-like sequences in the MP subgenomic promoter 

A second potentially problematic region corresponds to the MP subgenomic promoter 
(Fig. 2B). This region is very T-rich and resembles intron sequences very closely. As a 
consequence, many cryptic splice donor and acceptor sites are predicted in nearby 
sequences by intron prediction programs. Unfortunately, modifications cannot be made 
easily to this region without affecting subgenomic promoter function. We decided to 
completely mutagenize the entire region without regard for the subgenomic promoter, and to 
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provide MP in trans to compensate for the expected loss of MP expression. As MP will not be 
expressed from this construct, we also deleted most of MP sequence except for the 3' 
sequences that contain the CP subgenomic promoter which is required to drive expression of 
the gene of interest. We therefore replaced a 383 bp fragment in pICH 14833 (bp 4584 to 
5455 in GenBank accession BRU03387) by a 297 bp mutagenized fragment (SEQ ID No. 3). 
The resulting construct pICH 15900 (Fig. 6A) was agroinfiltrated in Nicotiana benthamiana 
leaves with or without pICH 10745. Interestingly, a huge increase in the number of cells 
initiating replication was detected in comparison to leaf areas infiltrated by pICH 14833. By 
counting GFP-expressing protoplasts prepared from infiltrated leaf areas, we estimate that 
this modification results in a 80 to 100-fold increase in the number of cells initiating viral 
replication compared to the unmodified plCH14833. plCH15900 was coinfiltrated with 
pICH 10745 (p35S-MP expression cassette) and an increase in GFP fluorescence was 
detected due to cell to cell movement. This increase was however very limited because so 
many cells already expressed GFP even in the absence of cell-to-cell movement. A 1000- 
fold dilution (corresponsing approximately to a calculated OD of 0.004 at 600 nm) of the 
agrobacterium suspension containing pICH 15900 coinfiltrated with a 5-fold diluted 
suspension of agrobacteria containing plCH10745 (corresponsing approximately to a 
calculated OD of 0.8 at 600 nm) gave rise to separate GFP expression foci. Fluorescent foci 
were as bright and of the same size as control foci obtained with pICH 14833. This tells us 
that the modification in plCH15900 and the delivery of MP in trans do not compromize 
functionality of the replicon regarding the level of replication, expression of the gene of 
interest and cell-to-ell movement. The same constructs (plCH14933 and plCH15900, with or 
without pICH 10745) were coinfiltrated to Nicotiana tabacum leaves. The modifications in 
plCH15900 lead to a similar increase in the number of cells initiating replication (in 
comparison to plCH14833) as they did in N. benthamiana. 

EXAMPLE 4 

Addition of introns improves the frequency of formation of functional RNA replicons in the 
cytoplasm 

We tested whether the addition of introns into viral pro-replicon sequences would 
increase the frequency of initiation of replication. Two constructs were made, plCH15025 and 
plCH15034 (Fig. 6A), each containing three different Arabidopsis thaliana introns in two 
different regions of the RdRP. plCH15025 was designed to contain introns in the middle of 
the RdRP, while p!CH15034 contains introns in the 3* end of the RdRP, upstream of the MP 
subgenomic promoter. The introns were amplified by PCR from Arabidopsis genomic DNA 
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and incorporated into viral sequences using PCR with primers overlapping the planned 
intron/exon junctions. The fragments containing the introns were subcloned into plCH1 4833 
as an Aval Hindlll fragment (SEQ ID No. 4 in the annex) to make plCH15025 or as a Pst1 
Ncol fragment (SEQ ID No. 5 in the annexe) to make plCH15034. 

Both constructs were separately agroinfiltrated into N. benthamiana leaves and 
compared to plCH14833. Both constructs significantly increased the number of cells initiating 
viral replication (Fig. 7A). This increase was estimated to be on the order of a 50-fold 
improvement relative to plCH14833. Both constructs were also coinfiltrated with an MP 
expressing clone, and cell-to-cell movement was found to be identical to clones without 
introns. Both constructs were also tested in N. tabacum, and a similar improvement was 
observed as in N. benthamiana (Fig. 7B). 

A third clone was made, plCH15499, which contained all 6 introns (Fig. 5, 6B, 7A, 
7B). This construct was tested in N. benthamiana and N. tabacum. This construct was more 
efficient than each individual construct with 3 introns, but the improvement was however less 
than additive. 

EXAMPLE 5 

Addition of introns and removal of intron-like sequences increases the frequency of the 
formation of functional RNA replicons in the cytoplasm 

Removing intron-like features and adding additional introns in one construct showed 
that both types of modifications can contribute to improve initiation of viral replication. We 
subcloned the 6 introns of plCH15499 into plCH15900, which contains the mutagenized MP 
subgenomic promoter region. The resulting clone plCH15860 (Fig. 6B) was infiltrated into N. 
benthamiana leaves and found to work significantly better than either parental clones within 
the range of approximately 50% to 90% of all protoplasts expressing GFP (Fig. 7). The best 
performing construct contains introns within the RdRP region and modified MP subgenomic 
promoter region (plCH16191 f Fig. 7C). In comparison to a clone without any modification, 
this represents an 80- to 300-fold improvement. This construct was also coinfiltrated with a 
MP-expressing construct (plCH10745) and it was found that the modifications did not 
compromize cell-to-cell movement or replication. 

EXAMPLE 6 

Not all intron additions increase the frequency of appearance of functional RNA replicons in 
the cytoplasm 
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We inserted two different Arabidopsis introns at the beginning of the RdRP, resulting 
in clone plCH15477 (the sequence of this region is shown as SEQ ID No. 6 in the annex). 
The sequence in this region already looks very "exon-like" (e.g. GC-rich without cryptic splice 
sites) before the addition of introns. No improvement on replication of viral initiation was seen 
with this construct. Therefore, not any addition of an intron will result in an improvement of 
the viral vector. It appears that the position chosen for intron insertion or mutagenesis is an 
important parameter. For example, all intron insertions or nucleotide substitutions that were 
made in regions near problematic structures such as the MP subgenomic promoter resulted 
in large improvements, while insertions of introns into sequences that are already "exon-like" 
did not. 

EXAMPLE 7 

Insertion of introns in MP sequences increase the frequency of viral replicon formation 

We first made a frameshift in the MP by digestion with the restriction enzyme Avrll, 
filling and religation. We then inserted two introns in the MP. The resulting clone pICH 16422 
(Fig. 6B) was infiltrated in Nicotiana benthamiana leaves. An about 100-fold increase in the 
number of cells containing the functional viral replicon was detected. 

EXAMPLE 8 

Insertion of introns into a MP containing vector improves the frequency of initiation of viral 
replication of autonomous functional clones 

A Kpn1 EcoRI fragment was subcloned from plCH15499 into plCH8543. The 
resulting clone, 16700 (Fig. 6B) contained a complete viral vector with 6 introns in the RdRP. 
This clone was infiltrated in N. benthamiana leaf and efficiently initiated replication. This 
clone was also able to move from cell to cell without the need to provide additional MP in 
trans. 

EXAMPLE 9 

Activation of an inactive replicon stably integrated on a chromosome 

It is also possible to stably transform intron-containing viral vector constructs in 
transgenic plants. To avoid deleterious viral replication that would inhibit plant growth, an 
inactive clone (pro-replicon) can be made by having a part of the vector present in antisense 
orientation (Fig. 8). Incorporation of recombination sites and of intron sequences at the 
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extremities of the inverted fragment allow this fragment to be 'flipped' in the correct 
orientation by using an appropriate recombinase. Recombination sites will be completely 
eliminated from the replicon by splicing. Introns in the pro-replicon allow efficient initiation of 
replication after recombination and transcription. In one specific example, the recombination 
sites are located within the gene of interest and downstream of the pro-replicon. Such a 
configuration prevents any gene expression before recombination. Other configurations can 
be considered where the recombination sites are located in other areas of the pro-replicon 
such as in the RdRP and upstream of the promoter. Intron sequences at the recombination 
site have the advantage of allowing to completely remove the recombination site from the 
replicon, but also increases the efficiency of viral replication, as described before. 

The flipped part can be located at the 3" end of the vector (as shown in Fig. 8), in the middle 
or at the 5' end, as shown in Fig. 12. Two constructs were made, plCH12691 (containing only 
one intron at the recombination site) and plCH16888 containing 6 additional introns in the 
RdRP. The sequence of the entire T-DNA region of plCH12691 is given in SEQ ID No. 7. 
plCH16888 is similar to plCH12691, but, in addition, contains the three introns described 
above in plCH15025 (SEQ ID No. 4) and the three introns described in plCH15034 (SEQ ID 
No. 5) inserted in the same position as in these constructs, respectively. Both plCH12691 
and pICH 16888 were stably transformed in Nicotians benthamiana using Kanamycin 
selection as follows. The constructs plCH12691 and plCH16888 were separately imobilized 
into A. tumefaciens (GV3101) and were separately used for v4grojbacfer/um-mediated leaf 
discs transformation of Nicotians plants as described by Horsh and colleagues (1985, 
Science, 227. 1229-1231) with minor modifications. Leaf discs were co-cultivated for 30 min 
in an agrobacterial suspension in Murashige and Skoog (MS) basal medium supplemented 
with 1 mg/L of alpha-naphthaleneacetic acid (NAA), 0.5 mg/L 6-benzaminopurine (BAP), 
200 microM acetosirengone (AS), pH5.5-5.6. Then leaf discs were placed on sterile 
Whatman® filter paper for removal of excessive liquid and transferred onto solid co-cultivation 
medium (0.8% agar prepared on MS supplemented as described above) for 48 hours 
cultivation in darkness at 22-23 Q C. After co-cultivation, leaf discs were placed on selective 
regeneration medium (0.8% agar prepared on MS supplemented with 1 mg/L BAP, 0.1 mg/L 
NAA, 1 mg/L MES (pH pH 5.7-5.8), 300 mg/L cefataxim, 50 mg/L kanamycin). After 3-6 
weeks of cultivation on regeneration medium, the shoots regenerated from kanamycin- 
resistant plant cells were transferred onto rooting selective medium (0.8% agar prepared on 
MS supplemented with 300 mg/L cefotaxim, 200 mg/L timentin to facilitate the" elimination of 
agrobacterium, 50 mg/L kanamycin, pH 5.7-5.8). Regenerated transformants were 
transferred to a glasshouse and tested by infiltration with a syringe without needle with an 
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agrobacterium suspsension containing an integrase expression construct (plCH10881: actin2 
promoter - PhiC31 integrase; or plCH14313: Zea maize transposable element Spm 
promoter- PhiC31 integrase). More pICH 16888 transformants exhibited viral replication foci 
after infiltration with the integrase construct than transformants of plCH12691 (Fig. 13). In 
addition, transformants of plCH16888 displayed more viral initiation foci per infiltration. 

EXAMPLE 10 

Plant viral RNA sequences contain potentially unstable regions 

The analysis of RNA profile of selected plant RNA viruses as well as one well 
characterised plant gene (AtDMCI) was performed by using the Netgenell server program 
(http://www.cbs.dtu.dk/services/NetGene2/). The RNA profile shown in Fig. 9 for AtDMCI 
clearly reflects the presence of 14 introns (circled), previously identified by comparing the 
cDNA and genomic DNA sequences. It is evident that RNA profiles of two plant viruses have 
regions (see the Figures 10, 11) which might cause problems for the stability of said RNA, if 
they are placed in plant nuclear environment. We have analysed the RNA profiles of several 
other representatives of plant RNA viruses (not shown), such as Brome Mosaic Virus, 
different strains of TMV, and many others. All of them have potential problematic regions that 
might compromise the efficiency of plant RNA virus-based replicon formation if delivered into 
the plant cell as DNA precursors. 

EXAMPLE 11 

Optimized vectors work in other species 

A fully optimized construct containing the mutagenized region (described in 
plCH15466) and 16 introns (including the six introns of plCH15860, the two introns of 
pICH 16422 and eight additional introns) was made. In summary this construct contains 
introns inserted at the following positions (given relative to TVCV sequence, GenBank 
accession BRU03387): nt 209, nt 828, nt 1169, nt 1378, nt 1622, nt 1844, nt 2228, nt 2589, 
nt 2944, nt 3143, nt 3381 , nt 3672, nt 3850, nt 4299, nt 5287, nt 5444. 

This construct was tested for expression in Beta vulgaris. Infiltration of the entire plant 
was performed as described next. Agrobacteria carrying plCH18711 were inoculated to 300 
ml" of LB containing 50 pg/ml Rifampicin and 56 [\g/rr\\ Kanamycin (selection for the binary 
vector) and grown until saturation. The bacteria were pelleted at 4800 g for 10 min and 
resuspended in 3 I of infiltration buffer (10 mM MES pH 5.5, 10 mM MgS0 4 ) in order get a 
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10-fold dilution relative to the saturated Agobacterium culture. A beaker containing the 
infiltration solution was placed in an exsiccator (30 mm diameter), with the aerial parts of a 
plant dipped in the solution. A vacuum was applied for two minutes using a Type PM 16763- 
860.3 pump from KNF Neuberger (Freiburg, Germany), reaching from 0.5 to 0.9 bar. The 
plants were returned to the greenhouse under standard conditions. 

GFP expression was high in leaves of the plants infiltrated with plCH18711 (Fig. 14). 
In contrast, only a few small spots could be seen in control plants infiltrated with plCH16700 
containing no intron (not shown). 



ANNEX 

SEQ ID No. 1 (Ncol-EcoRI fragment of plCH1 4833): 

ccatggacaaagtgataaaggcagctttttgtggagacgatagcctgatttacattcctaaaggtttagacttgcctgatattcaggc 

gggcgcgaacctcatgtggaacttcgaggccaaactcttcaggaagaagtatggttacttctgtggtcgttatgttattcaccatgat 

agaggagccatlgtgtattacgatccgcttaaactaatatctaagttaggttgtaaacatattagagatgttgttcacttagaagagtta 

cgcgagtcmgtgtgatgtagctagtaacttaaataattgtgcgtattmcacagttagatgaggccgttgccgaggttcataagacc 

gcggtaggcggttcgmgcttmgtagtataattaagtaffigtcagat 

gtctcgtacgaacctaaggtgagtgatttcctcaatctttcgaagaaggaagagatcttgccgaaggctctaacgaggttagaattc 
SEQ ID No. 2 (part of plCH1 5466): 

ggagataacctgagcttcttcttccataatgagagcactctcaattacacccacagcttcagcaacatcatcaagtacgtgtgcaag 

acgttcttccctgctagtcaacgcttcgtgtaccacaaggagttcctggtcactagagtcaacacttggtactgcaagttcacgagag 

tggatacgttcactctgttccgtggtgtgtaccacaacaatgtggattgcgaagagttttacaaggctatggacgatgcgtggcacta 

caaaaagacgttagcaatgcttaatgccgagaggaccatcttcaaggataacgctgcgttaaacttttggttcccgaaagtgaga 

gacatggttatcgtccctctctttgacgcttctatcacaactggtaggatgtctaggagagaggttatggtgaacaaggacttcgtcta 

cacggtcctaaatcacatcaagacctatcaagctaaggcactgacgtacgcaaacgtgctgagcttcgtggagtctattaggtcta 

gagtcataattaacggtgtcactgccaggtctgaatgggacacagacaaggcaattctaggtccattagcaatgacattcttcctga 

tcacgaagctgggtcatgtgcaagat 

SEQ ID No. 3 (part of pICH1 5900): 

gcggacgatacgtgatccaccatgatagaggagccattgtgtattacgatccgcttaaactaatatctaagctcggctgcaagcac 
atcagagacgtcgtgcacttagaagagttacgcgagtctttgtgcgacgtagctagtaacttgaacaactgcgcctacttctcacag 
ttagatgaggccgttgctgaggtccaraagactgcggtcggaggctcc^ 
aggctgttcagggacctgttcttcgtctgagttgacg 
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SEQ ID No. 4 (part of plCH15025): (contains 3 Introns shown underlined in italics) 
Cccgagctatactgtaccttcgccgaccgattggtactacagtacaagaaggcggaggagttccaatcgtgtgatctttccaaacc 
tctagaagagtcagagaagtactacaacgcattatccgagctatcagtgcttgagaatctcgactcttttgacttagaggcgtttaag 
actttatgtcagcagaagaatgtggacccggatatggcagcaaag 

gtaaatcctggtccacacmacgataaaaacacaagattttaaactatgaactgatcBataatcattcctaaaagaccacacmg 
fflgtttctaaagtaatttttactgttataacag 

gtggtcgtagcaatcatgaagtcagaattgacgttgcctttcaagaaacctacagaagaggaaatctcggagtcgctaaaacca 

ggagaggggtcgtgtgcagagcataaggaagtgttgagcttacaaaatgatgctccgttcccgtgtgtgaaaaatctagttgaag 

gttccgtgccggcgtatggaatgtgtcctaagggtggtggtttcgacaaattggatgtggacattgctgatttccatctcaagagtgta 

gatgcagttaaaaagggaactatgatgtctgcggtgtacacagggtctatcaaagttcaacaaatgaagaactacatagattactt 

aagtgcgtcgctggcagctacagtctcaaacctctgcaag 

ataaaaaatcaaaaaafficcacaataatccctcttttffigffictctaaffic^^ 

atgcatatttgtgatgagacaaatgtttgttctatgttttag 

gtgcttagagatgttcacggcgttgacccagagtcacaggagaaatctggagtgtgggatgttaggagaggacgttggttacttaa 

acctaatgcgaaaagtcacgcgtggggtgtggcagaagacgccaaccacaagttggttattgtgttactcaactgggatgacgg 

aaagccggtttgtgatgagacatggttcagggtggcggtgtcaagcgattccttgatatattcggatatgggaaaacttaagacgct 

cacgtcttgcagtccaaatggtgagccaccggagcctaacgccaaagtaattttggtcgatggtgttcccggttgtggaaaaacga 

aggagattatcgaaaag 

ataaattctacaffiaattatgctccttacamaaatattcQtcactcttcca^^ 

ttgcctcagttctaactgtttgtggtatttttgttttaattattgctacag 

gtaaacttctctgaagacttgattttagtccctgggaaggaagctt 

SEQ ID No. 5 (part of plCH15034): (contains 3 Introns shown underlined in italics) 
ctgcag 

ataaaatattgaatgccagacaatattcfficffiaaffigtaacmcctgtcaagatcaataaafflattt^ 
aattttttffiggagccattatataafflcctaattaactgaaccaaaattatacaaaccacr 

gtttgctggaaaatttggttgcaatgatcaaaagaaacatgaatgcgccggatttgacagggacaattgacattgaggatactgca 
tctctggtggttgaaaagttttgggattcgtatgttgacaaggaatttagtggaacgaacgaaatgaccatgacaagggagagcttc 

tccag 

ataaaaacttctcataaatattagtggcagattaatattattaaaatcffiggttagataatcaa 
ttttctacaattaaag 

gtggctttcgaaacaagagtcatctacagttggtcagttagcggactttaactttgtggatttgccggcagtagatgagtacaagcat 
atgatcaagagtcaaccaaagcaaaagttagacttgagtattcaagacgaatatcctgcattgcagacgatagtctaccattcga 
aaaagatcaatgcgatmcggtccaatgtfflcagaacttacgagga^ 

ccagaaagacacctgcacaaatagaggacttcttttctgacctagactcaacccaggcgatggaaattctggaactcgacatttc 
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gaagtacgataagtcacaaaacgagttccattgtgctgtagagtacaagatctgggaaaagttaggaattgatgagtggctagct 
gaggtctggaaacaag 

ataaattcctaaattccattmptaatccttcaatattafflaacMcaaatc^^ 
aatatttttcatqtatatatag 

gtcacagaaaaacgaccttgaaagattatacggccggaatcaaaacatgtctttggtatcaaaggaaaagtggtgatgtgacaa 
cctttattggtaataccatcatcattgccgcatgtttgagctcaatgatccccatgg 

SEQ ID No. 6 (fragment of pICH 15477, containing 1 Intron shown in underlined italics) 
Gttttagttttattgcaacaacaacaacaaattacaataacaacaaacaaaatacaaa'caacaacaacatggcacaatttcaac 
aaacaattgacatgcaaactctccaagccgctgcgggacgcaacagcttggtgaatgatttggcatctcgtcgcgtttacgataat 
gcagtcgaggagctgaatgctcgttccagacgtcccaag 

ptaaaacaacamcattcacatatatpaatacmptcattpaptacpaaoaapacacttactacttpttpatpaaapmccpccm 

atacttatctatatcafflcatcafficaaactagtataaaattaaatgatatttate^^ 

ttttpapttapttttotataatatttttccctptttpatpttap 

gttcatttctccaaggcagtgtctacggaacagacactgattgcaacaaacgcatatccggagttcgagatttcctttactcatacgc 
aatccgctgtgcactccttggccggaggccttcggtcacttgagttggagtatctcatgatgcaagttccgttcggctctctgacctac 
gacatcggcggaaacttctccgcgcacctcttcaaaggtaattttctttctctactcaattttctccaagatccaatatttga^ 
ctatagttaaaattaatctctactccattcttgttacctcaggtcgcgattacgttcactgctgcatgc: 

gttttagttttattgcaacaacaacaacaaattacaataacaacaaacaaaatacaaacaacaacaacatggcacaatttcaac 

aaacaattgacatgcaaactctccaagccgctgcgggacgcaacagcttggtgaatgatttggcatctcgtcgcgtttacgataat 

gcagtcgaggagctgaatgctcgttccagacgtcccaaggtaaaacaacatttcattcacatatatgaatacttttgtcattgagtac 

gaagaagacacttactacttgttgatgaaagmccgccffiatactta 

tgmatatgatatcatggaacattaatctatagggaaactgtffi^ 

aaggcagtgtctacggaacagacactgattgcaacaaacgcatatccggagttcgagatttcctttactcatacgcaatccgctgt 
gcactccttggccggaggccttcggtcacttgagttggagtatctcatgatgcaagttccgttcggctctctgacctacgacatcggc 
ggaaacttctccgcgcacctcttcaaaggtaattttcmctctactcaattttctccaagatccaatamgaaga 
aattaatctctactccattcttgttacctcaggtcgcgattacgttcactgctgcatgc 

SEQ ID No. 7: T-DNA region of plCH1 2691, wherein sequence segments have the following 
function: 

Nucleotides 1 to 25: Left border (opposite strand), 

Nucleotides 86 to 1484: Nos promoter-NPTll coding sequence-Nos terminator (on the 
opposite strand), 

Nucleotides 1506 to 1552: AttP recombination site (opposite strand), 
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Nucleotides 1553 to 1599: intron 5' part (opposite strand), 

Nucleotides 1600 to 2022: TVCV RdRP 5' end (opposite strand), 

Nucleotides 2023 to 2809: Arabidopsis actin 2 promoter (opposite strand), 

Nucleotides 2836 to 2903: AttB recombination site, 

Nucleotides 2904 to 2959: intron 3' part, 

Nucleotides 2960 to 7991 : TVCV RdRP 3' part-MP 5' part, 

Nucleotides 7992 to 8168: cr-TMV MP 3' end, 

Nucleotides 8248 to 8967: GFP coding sequence 

Nucleotides 8961 to 9215: cr-TMV 3' untranslated region, 

Nucleotides 9234 to 9497: Nos terminator, 

Nucleotides 9549 to 9473: T-DNA right border (opposite strand): 

tggcaggatatattgtggtgtaaacaaattgacgcttagacaacttaataacacattgcggacgtttttaatgtactggggtggatgc 
aggtcgatctagtaacatagatgacaccgcgcgcgataatttatccte^ 

attgcgggactctaatcataaaaacccatctcataaataacgtcatgcattacatgttaattattacatgcttaacgtaattcaacaga 

aattatatgataatcatcgcaagaccggcaacaggattcaatcttaagaaactttattgccaaatgtttgaacgatctgcttgactcta 

gatccagagtcccgctcagaagaactcgtcaagaaggcgatagaaggcgatgcgctgcgaatcgggagcggcgataccgta 

aagcacgaggaagcggtcagcccattcgccgccaagctcttcagcaatatcacgggtagccaacgctatgtcctgatagcggtc 

cgccacacccagccggccacagtcgatgaatccagaaaagcggccattttccaccatgatattcggcaagcaggcatcgccat 

gagtcacgacgagatcctcgccgtcgggcatacgcgccttgagcctggcgaacagttcggctggcgcgagcccctgatgctcttc 

gtccagatcatcctgatcgacaagaccggcttccatccgagtacgtgctcgctcgatgcgatgtttcgcttggtggtcgaatgggca 

ggtagccggatcaagcgtatgcagccgccgcattgcatcagccatgatggatactttctcggcaggagcaaggtgagatgacag 

gagatcctgccccggcacttcgcccaatagcagccagtcccttcccgcttcagtgacaacgtcgagcacagctgcgcaaggaa 

cgcccgtcgtggccagccacgatagccgcgctgcctcgtcctggagttcattcagggcaccggacaggtcggtcttgacaaaaa 

gaaccgggcgcccctgcgctgacagccggaacacggcggcatcagagcagccgattgtctgttgtgcccagtcatagccgaat 

agcctctccacccaagcggccggagaacctgcgtgcaatccatcttgttcaatcatgcgaaacgatccagatccggtgcagatta 

tttggattgagagtgaatatgagactctaattggataccgaggggaamatggaacgtcagtggagcatttttgacaagaaatattt 

gctagctgatagtgaccttaggcgacttttgaacgcgcaataatggtttctgacgtatgtgcttagctcattaaactccagaaacccg 

cggctgagtggctccttcaacgttgcggttctgtcagttccaaacgtaaaacggcttgtcccgcgtcatcggcgggggtcataacgt 

gactcccttaattctccgctcatggtaccagcttctcgagcgaccctacgcccccaactgagagaactcaaaggttaccccagttg 

gggcacaacaaaaatcaaatctaaatttgtgtaattatgaaaatgaaacttacctttgaagaggtgcgcggagaagtttccgccga 

tgtcgtaggtcagagagccgaacggaacttgcatcatgagatactccaactcaagtgaccgaaggcctccggccaaggagtgc 

acagcggattgcgtatgagtaaaggaaatctcgaactccggatatgcgtttgttgcaatcagtgtctgttccgtagacactgccttgg 

agaaatgaaccttgggacgtctggaacgagcattcagctcctcgactgcattatcgtaaacgcgacgagatgccaaatcattcac 

caagctgttgcgtcccgcagcggcttggagagtttgcatgtcaattgtttgttgaaattgtgccatgttgttgttgWgtatW^ 

attgtaamgttgttgttgttgcaataaaactaaaacttcaaagcggagaggaaaatatatgaatttatataggcgggmatctcttac 

aactttattttcggcctttcaaaaaaataattaaaatcgacagacacgaatcatttcgaccacaggtaaagataacgtgacctggct 

gtcagacagccttttccctcgtgttaactaattHtaaactaattaatcatctcagcccttggattagttctttt 

gtgacctgctcgatccgcgtgttacatgacagctccgtttttttagtggttaacttaaaccgagtcaatccaggcaacgttagtcgte 

ogtggttggcttgttcaattagamcatacaattcaacgtaaWaattcgtmctattagaattgtatcataattaattcagaccg 

gaaagtgtcmcatgatgtgmatggataWatacaataagatacaatgmcatcatattcactattcacgattagtatgtacattaaat 

aatggctactactacatccgaactcgtcaaaacgattctgaatcaattatacatatgctgactcttgcatacataaaaaatagttgttt 

aaatmgtctaactaatgWggtataagtataatgttgagttgagata^ 

aaaacmcttacataatgataacctagatctagatgagatatgtate^ 

aacgcgtggtacgctgcagaattgctcgaagccgcggtgcgggtgccagggcgtgcccttgggctccccgggcgcgtactcca 
cctcacccatctfflattacatgfflgaacttc^ 

aatctggatgtacgtgacattgctcgccatgaaggacacaaggaagctatttacagttatgtgaatcgtttgaaaaggcagcagcg 
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tcctgtgcctgaataccagagggcagctttcaacaactacgctgagaacccgcacttcgtccattgcgacaaacctttccaacagt 

gtgaattgacgacagcgtatggcactgacacctacgctgte^ 

ctactcaggaagaatgtgaaaacttgtttcgcggcctttcaffi^ 

gattggagctacttttcagaagtccggtgataatttaagttttttcWcataatgagagcactctcaattacac 
aattaagtatgtgtgtaaaacgttcmcctgctagtcaacggffi^ 

gtaagtttacgagagtggatacttttactcttttccgtggtgtgtaccataataatgtggatlgcgaagagttttaca^ 

atgcgtggcactacaaaaagacgttagcaatgcttaatgccgagaggaccatcttcaaggataacgctgcgttaaacttttggttcc 

cgaaagtgagagacatggttatcgtccctctctttgacgcttctatcacaactggtaggatgtctaggagagagattatggtgaaca 

aggaWcgtttatacggtcctaaatcacataaaaacgtatcaagctaaggctttaacttacgcaaatgttctgtcctttgtggag 

taggtctagagtgataattaacggtgtcactgccaggtctgaatgggacacagacaaggcaattctaggtccattagcaatgacat 

ttttccttataacaaagttgggtcatgtgcaggatgaaataatcctgaaaaagttccagaagttcgacagaaccaccaatgagctg 

atttggacaagtctctgcgatgccctgatgggggttatlccctcggtcaaggagacgcttgtgcgcggtggttttgtgaaagte 

aacaagccttagagataaaggttcccgagctatactgtacctttgccgacagattggtactacagtacaagaaggcggaggagtt 

ccaatcgtgtgatctttccaaacctctagaagagtcagagaagtactacaacgcattatccgagctatcagtgcttgagaatctcga 

ctctWgacttagaggcgffiaagactttatgtcagcagaagaa^ 

gaagtcagaattgacgttgcctttcaagaaacctacagaagaggaaatctcggagtcgctaaaaccaggagaggggtcgtgtg 

cagagcataaggaagtgttgagcttacaaaatgatgctccgttcccgtgtgtgaaaaatctagttgaaggttccgtgccggcgtatg 

gaatgtgtcctaagggtggtggtttcgacaaattggatgtggacattgctgatttccatctcaagagtgtagatgcagttaaaaaggg 

aactatgatgtctgcggtgtacacagggtctatcaaagttcaacaaatgaagaactacatagattactlaagtgcgtcgctggcag 

ctacagtctcaaacctctgcaaggtgcttagagatgttcacggcgttgacccagagtcacaggagaaatctggagtgtgggatg^ 

aggagaggacgttggtlacttaaacctaatgcgaaaagtcacgcgtggggtgtggcagaagacgccaaccacaagttggttatt 

gtgttactcaactgggatgacggaaagccggtttgtgatgagacatggttcagggtggcggtgtcaagcgattccttgatatattcgg 

atatgggaaaacttaagacgctcacgtcttgcagtccaaatggtgagccaccggagcctaacgccaaagtaattttggtcgatggt 

gttcccggttgtggaaaaacgaaggagattatcgaaaaggtaaacttctctgaagacttgattttagtccctgggaaggaagctt^^ 

aagatgatcatccggagggccaaccaagctggtgtgataagagcggataaggacaatgttagaacggtggattccttcttgatgc 

atccttctagaagggtgtttaagaggttgtttatcgatgaaggactaatgc^ 

atgtgacgtcgcatatgtgtatggggacacaaagcaaattccgtt(^tttgcagagtcgcgaactttccgtatccagcgcattttgca 

aaactcgtcgctgatgagaaggaggttagaagagttacgctcaggtgcccggctgatgttacgtatttccttaacaagaagtatga 

cggggcggtgatgtgtaccagcgcggtagagagatccgtgaaggcagaagtggtgagaggaaagggtgcattgaacccaat 

aaccttaccgttggagggtaaaattttgaccttcacacaagctgacaagttcgagttactggagaagggttacaaggatgtgaaca 

ctgtgcacgaggtgcaaggggagacgtacgagaagactgctattgtgcgcttgacatcaactccgttagagatcatatcgagtgc 

gtcacctcatgttttggtggcgctgacaagacacacaacgtgttgtaaatattacaccgttgtgttggacccgatggtgaatgtgattt 

cagaaatggagaagttgtccaatttccttcttgacatgtatagagttgaagcgggggtccaatagcaattacagatcgatgcagtatt 

cagggacagcaacttgtttgttcagacgcccaagtcaggagattggcgagatatgcaattttactatgacgctctt^ 

cagtactattctcaatgaatttgatgctgttacgatgaatttga 

atccgtgcaacttcctaaagaacaacctattttcctcaagcctaaaataagaactgcggcagaaatgccgagaactgcaggtttg 
ctggaaaatttggttgcaatgatcaaaagaaacatgaatgcgccggatttgacagggacaattgacattgaggatactgcatctct 
ggtggttgaaaagttttgggattcgtatgttgacaaggaatttag^ 

atggctttcgaaacaagagtcatctacagttggtcagttagcggactttaactttgtggatttgccggcagtagatgagtacaag 

atgatcaagagtcaaccaaagcaaaagttagacttgagtattcaagacgaatatcctgcattgcagacgatagtctaccattcga 

aaaagatcaatgcgattttcggtccaatgttttcagaacttacgagg 

ccagaaagacacctgcacaaatagaggacttcttttctgacctagactcaacccaggcgatggaaattctggaactcgacattt^ 
gaagtacgataagtcacaaaacgagttccattgtgctgtagagtacaagatctgggaaaagttaggaattgatgagtggctagct 
gaggtatggaaacaaggacacagaaaaacgaccttgaaagattatacggccggagtcaaaacatgtctttggtatcaaagga 
aaagtggtgatgtgacaacctttattggtaataccatcatcattgra^ 

ggcagctttttgtggagacgatagcctgatttacattcctaaaggtttagacttgcctgatattcaggcgggcgcgaacctcatgt^ 

aacttcgaggccaaactcttcaggaagaagtatggttacttctgtggtcgtlatgttattcaccatgatagaggagccattgtgte 

gatccgcttaaactaatatctaagttaggttgtaaacatattagagatgttgttcacttagaagagttacgcgagtctttgtgtgatgta 

ctagtaacttaaataattgtgcgtattmcacagttagatgaggccgttgccgaggttcataagaccgcggtaggcggttcgWgc^ 

ttgtagtataatteagtafflgtcagate^ 

agtgatttcctcaatcmcgaagaaggaagagatcttgccgaaggctctaacgaggttaaaaaccgtgtctattagte 

attatatctgtcaaggagtcggagacmgtgtgatatagamgttaa 

gagctgtMaccggagagtggctagtgccagacttcgttaaa 
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caaaggagtgcgtgattggtacgtacagagccgcagccaagagtaag 

accgtggacgcaaagaggaagccgtggcaggttcatgttcgtatacaagacttgaagattgaggcgggttggcagccgttagct 

ctggaagtagmcagttgctatggtcaccaataacgttgtcatgaagggtttgagggaaaaggtcgtcgcaataaatgatccg 

gtcgaaggtttcgaaggtgtggttgacgaattcgtcgattcggttgcagcatttaaagcggttgacaactttaaaagaaggaa 

gaaggttgaagaaaagggtgtagtaagtaagtataagtacagaccggagaagtacgccggtcctgattcgtttaatttgaaagaa 

gaaaacgtcttacaacattacaaacccgaataatcgataactcgagtatttttacaacaattaccaacaacaacaaacaacaaa 

caacattacaattacatttacaattatcatggtgagcaagggcgaggagctgttcaccggggtggtgcccatcctggtcgagctgg 

acggcgacgtaaacggccacaagttcagcgtgtccggcgagggcgagggcgatgccacctacggcaagctgaccctgaagtt 

catctgcaccaccggcaagctgcccgtgccctggcccaccctcgtgaccaccttcagctacggcgtgcagtgcttcagccgctac 

cccgaccacatgaagcagcacgacttcttcaagtccgccatgcccgaaggctacgtccaggagcgcaccatcttcttcaaggac 

gacggcaactacaagacccgcgccgaggtgaagttcgagggcgacaccctggtgaaccgcatcgagctgaagggcatcgac 

ttcaaggaggacggcaacatcctggggcacaagctggagtacaactacaacagccacaacgtctatatcatggccgacaagc 

agaagaacggcatcaaggtgaacttcaagatccgccacaacatcgaggacggcagcgtgcagctcgccgaccactaccagc 

agaacacccccatcggcgacggccccgtgctgctgcccgacaaccactacctgagcacccagtccgccctgagcaaagacc 

ccaacgagaagcgcgatcacatggtcctgctggagttcgtgaccgccgccgggatcactcacggcatggacgagctgtacaag 

taaagcggcccctagagcgtggtgcgcacgatagcgcatagtgtttttctctccacttgaatcgaagagatagacttacggtgtaaa 

tccgtaggggtggcgtaaaccaaattacgcaatgtmgggttccamaaatcgaaaccccttatttcctggatcacctgttaacgca 

cgtttgacgtgtattacagtgggaataagtaaaagtgagaggttcgaatcctccctaaccccgggtaggggcccagcggccgctc 

tagctagagtcaagcagatcgttcaaacatltggcaataaagmcttaagattgaatcctgttgccggtcttgcgatgattatcata 

amctgttgaattacgttaagcatgtaataattaacatgtaatgcatgacgttatttatgagatgggttmatgattagagtcc 

atacatttaatacgcgatagaaaacaaaatatagcgcgcaaactaggataaattatcgcgcgcggtgtcatctatgttactagatc 

gaccagcttagatcagattgtcgtttcccgccttcagWaaactatcagtgtttgacaggatatattggcgggtaaac 



